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INTRODUCTION 


The  general  continuous  linear  program  is  formulated  as 

follows: 

T 

minimize  / c(t)  x(t)dt 

0 


subject  to 

t 

B(t)  x(t)  + / K(t,s)  x(s)ds  = b(t) 
0 

x(t)  >0,  t € [0, T] 


where  c(t),  b(t)  and  B(t),  K(t, s)  are  given  and  are  real  vectors 
and  matrices  respectively. 

A special  case  of  this  is  a linear  optimal  control  problem 
with  constraints  on  both  the  state  and  control  variables: 

T 

minimize  / {c(t)  x(t)  + d(t)  u(t)]dt 

0 

subject  to  ^ x(t)  = A(t)  x(t)  + B(t)  u(t)  + a(t) 

0 = C(t)  x(t)  + D(t)  u(t)  + b(t) 
x(t)  > 0,  u(t)  >0,  t € [0,T] 
x(0)  given. 
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The  motivation  for  studying  such  problems  is  the  following; 

(i)  They  are  widely  applicable  to  many  real  world  situations  as 

intertemporal  economic  models  of,  say,  investment  and  planning 
(e.g.  [5]  and  [7]),  and  occur  frequently  in  engineering 
applications  (e.g.  [2^]). 

(ii)  They  are  closely  related  to  their  discrete  time  counterparts 
which,  formulated  as  block  triangular  linear  programs,  are 
costly  to  solve.  Computational  experience  with  such  linear 
programming  models  shows  that  they  often  require  unusually 
many  simplex  iterations  [1],  [7],'*’  and  also  become  very  large 
in  size.  On  the  other  hand,  many  small  continuous  time  problems 
have  been  observed  to  have  nice  mathematical  properties,  and 
exact  solutions  have  been  easily  obtainable  by  hand  [12],  [ lB ] . 
This  suggests  that  a thorough  understanding  of  continuous 
linear  programs  may  not  only  result  in  methods  that  can  solve 
them  directly  and  efficiently  on  a computer,  but  also  result 
in  new  improved  methods  for  solving  the  discrete  time 
formulations. 

(iii)  Many  continuous  time  problems  are  inherently  numerically  un- 
stable when  solved  in  discrete  time  as  linear  programs.  Indeed, 
one  can  construct  examples  which  are  easily  solved  in  continuous 
time,  but  which,  when  discretized  with  time  step  e,  have 

^This  seems  to  be  attributable  to  the  observed  persistence  property 
of  these  models  [8],  [21],  i.e.  similar  type  activities  persist 
in  the  basis  for  several  consecutive  time  periods.  In  the  continuous 
case  this  can  be  interpreted  as  activities  remaining  positive  over 
intervals  of  time. 


corresponding  linear  programming  bases  with  condition  numbers 
proportional  to  e , where  m is  the  number  of  rows  in  the 
continuous  time  problem.  In  such  cases,  some  other  solution 
technique  becomes  a necessity  for  any  appreciably  small  e. 

Continuous  linear  programs  have  been  largely  studied  as  linear 
programs  in  a function  space  with  the  emphasis  on  generalizing  the 
simple  but  powerful  results  of  linear  programming.  They  were  first 
considered  in  1953  by  Bellman  [2], [3]  who  established  a weak  duality 
result  which,  as  a sufficient  condition  for  optimality,  could  be  used 
to  obtain  optimal  solutions  by  "good  guesswork."  Since  then  work  has 
been  mainly  in  two  areas:  strong  duality  theorems  and  computational 
methods. 

The  first  strong  duality  theorem  was  established  by  Tyndall  [26] 
in  1965,  and  subsequently  strengthened  by  a number  of  authors,  e.g. 
Grinold  [ 14]  and  Levine  and  Pomerol  [19].  These  authors  imposed  alge- 
braic conditions  on  the  coefficients  defining  the  problem  so  as  to 
directly  generalize  the  strong  duality  theorem  of  linear  programming  [6] 
to  the  case  of  continuous  linear  programs  in  the  space  of  bounded  measur- 
able functions.  Another  strong  duality  theorem  requiring  a Slater  con- 
dition to  be  satisfied  was  obtained  by  Hager  and  Mitter  [15]  for  a 
variant  of  the  above  optimal  control  formulation. 

The  dominant  theme  on  the  computational  side  has  been  the  attempt 
to  generalize  the  simplex  method  to  a function  space.  In  1964  Dantzig 
[9]  showed  that  the  control  theory  formulation  with  no  state  variable 
constraints  could  be  solved  using  the  Dantzig-Wolfe  decomposition 
principle.  Work  on  the  general  problem  was  done  first  by  Lehman  [18] 
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in  1954  and  then  pursued  by  Drews,  Hartberger  and  Segers  [12]  in  1970. 
Their  central  idea  was  to  mirror  the  revised  simplex  method  step-by-step 
in  continuous  time.  Thus  they  would  begin  with  a "basic  feasible  solu- 
tion" which,  roughly  speaking  is  a solution  whose  positive  values  are 
uniquely  determined  once  the  remaining  ones  are  held  fixed  at  zero.  The 
first  step  would  be  to  compute  relative  "prices"  backwards  in  time  by 
requiring  complementary  slackness  conditions  to  hold.  These  prices 
would  then  be  used  to  identify  some  variable  that  is  currently  at  zero 
on  some  subinterval  of  [0,T],  and  which  when  increased  over  an  interval 
would  yield  an  improved  solution.  This  increase  would  be  made  in  such  a 
way  that  the  new  solution  was  again  a "basic  feasible  solution." 

Teren  [24]  in  1977  developed  a similar  algorithm  for  the  optimal 
control  formulation  allowing,  in  addition,  for  end  point  constraints. 
However,  his  regularity  assumptions  are  very  restrictive. 

The  role  of  the  above  works,  [18],  [24]  and  especially  [12], 
has  been  to  demonstrate  that  the  concepts  and  steps  of  the  simplex 
method  can  be  directly  generalized  to  a function  space  setting.  However, 
many  questions  are  left  unanswered,  and  some  of  their  suggested  con- 
structions need  to  be  made  more  precise  if  they  are  to  be  developed 
with  mathematical  rigor. 

In  this  dissertation,  we  consider  some  of  the  foundational 
aspects  of  a theory  of  a continuous  time  simplex  method  in  the  same 
spirit  as  [12]  and  [18]. 


The  first  task  we  address  is  that  of  obtaining  a useful  character- 
ization of  extreme  point  solutions.  This  is  achieved,  in  Chapter  2, 
in  the  case  of  constant  coefficients  and  in  the  class  of  so-called 


right  analytic  solutions.  The  main  result  there  is  that  there  is  a 
one-to-one  correspondence  between  extreme  points  and  solutions 
satisfying  certain  full  rank  conditions,  in  a manner  reminiscent  of  the 
analogous  characterization  in  linear  programming,  and  also  the  bang- 
bang  principle  of  optimal  control  theory.  This  characterization  is 
used  to  define  a "basic  feasible  solution"  and  is  the  foundation  of 
the  proposed  continuous  time  simplex  method  in  Chapter  4.  Examples 
are  also  presented  to  show  that  this  kind  of  characterization  cannot 
hold  for  the  general  (time  varying  coefficients)  case.  This  gives 
some  indication  that  it  might  be  meaningless  to  look  for  a general 
continuous  time  simplex  method. 

In  Chapter  3 we  motivate  the  approach  to  be  taken  in  Chapter  4. 
This  is  done  by  first  reviewing  the  important  steps  in  the  simplex 
method,  and  then  analyzing  an  example  in  both  continuous  and  discrete 
time. 

Chapter  4 is  chiefly  concerned  with  the  problem  of  how  to  move 
from  one  basic  feasible  solution  to  an  improved  one.  The  first 
aspect  considered  is  the  statement  of  the  optimality  conditions.  It 
is  well  known,  e.g.  [12],  [l8],  that  the  dual  problem  need  not  have 
an  optimal  solution  in  any  space  of  real  valued  functions  even  though 
the  primal  may  be  bounded  and  possess  a unique  analytic  optimal 
solution.  While  the  existence  of  simple  examples  to  this  effect 
motivated  the  search  for  conditions  on  the  coefficients  where  this 
would  not  occur,  e.g.  [l4],  [26],  it  also  simultaneously  yielded  the 
realization  that  any  continuous  time  simplex  method  would  have  to  be 
able  to  accommodate  these  cases,  [12],  [18],  Indeed, working  in  the 
larger  function  space  of  distributions^"  which  contains  the  Dirac 
''"See  Appendix  C. 
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5 functional  and  its  higher  order  derivatives  allowed  these  dual 


problems  to  attain  their  optima.  However,  interpreting  a dual 
infeasibility  (related  to  a statement  of  non-optimality)  in  the  space 
of  distributions  requires  some  care,  and  this  is  an  aspect  ignored 
in  [12]  and  [18], 

We  here  take  the  view  that  it  is  reasonable  to  assume 
that  the  primal  problem  is  bounded.  This  means  that  the  need 
to  work  with  distributions  arises  only  in  the  dual.  We  then 
formulate  a new  dual  problem  with  all  quantities  being  real  valued. 
This  dual  is  shown  to  be  equivalent  to  a formulation  in  the  space 
of  distributions  that  allows  only  finitely  many  occurrences  of  the  5 
and  its  derivatives,  but  which  nevertheless  can  be  studied  in  an 
entirely  distribution  free  context.  The  usual  weak  duality  theorem 
is  established  and  sufficient  optimality  conditions  deduced. 

Next  is  presented  a new  construction  which,  under  certain  non- 
degeneracy and  regularity  assumptions,  will  begin  with  a given  extreme 
point  solution  and  from  there  continuously  trace  out  a path  of  extreme 
points  along  which  descent  is  obtained  for  at  least  a short  while, 
j't  differs  fundamentally  from  the  usual  basis  change  in  linear 
programming  in  that  we  are  here  simultaneously  increasing  nonbasic 
variables  having  both  negative  and  positive  reduced  costs,  but  so 
that  the  overall  result  still  gives  descent.  The  approach  taken  here 
was  inspired  by  Dantzig  [10]  who  suggested  that  dual  shadow  prices 
could  be  used  to  find  which  nonbasic  variables  to  introduce  into  the 
basic  set  because  their  values  would  dominate  any  changes  in  the 
objective  due  to  corrections  in  the  location  of  the  breakpoints  in 
order  to  maintain  feasibility.  This  thesis  gives  conditions  (and 
proofs)  for  which  his  conjecture  holds. 
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Chapter  5 contains  the  conclusions  and  mention  of  some  of  the 
many  remaining  unanswered  questions.  The  appendices  contain  examples, 
background  material  on  distributions  and  Laplace  transforms,  and  also 
technical  lemmas,  some  of  which  are  of  independent  interest. 

Appendix  D in  particular  contains  an  interesting  existence  and  unique- 
ness result  for  a set  of  simultaneous  nonlinear  equations,  the  solutions 
to  which  are  the  breakpoints  in  one  of  the  basis  change  constructions. 

We  conclude  this  introduction  with  a remark  on  the  optimal 
control  problem  formulated  earlier.  Because  of  the  presence  of  mixed 
control  and  state  variable  constraints,  the  problem  is  very  different 
in  character  from  and  substantially  more  difficult  to  solve  than  the 
usual  control  problem  having  the  state  variables  unrestricted.  See 
e.g.  [13],  Indeed,  the  distinction  between  state  and  control  variables 
becomes  artificial,  in  general  yielding  no  additional  insight.  When 
it  is  reformulated  as  a continuous  linear  program, 1 the  state  and 
control  variables  are  lumped  together  in  a single  vector.  Thus,  rather 
than  thinking  of  choosing  a control  to  steer  the  system  along  a certain 
trajectory,  we  instead  take  the  combinatorial  approach  of  choosing  the 
active  constraint  sets  from  amongst  both  the  state  and  control 
variables. 


1. 1.  Notation 

Let  n denote  { 1, 2, , . . , n) . 

For  p C n,  let  |g|  be  the  cardinality  of  g. 


For  an  m x n 

matrix.  A, 

and  a C m,  PC 

n. 

let 

aq. 

denote 

the  submatrix  of  A 

whose  rows 

are  indexed  by 

a. 

let 

% 

denote 

"'"By  integrating  both  sides  of  the  system  equation. 
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the  submatrix  of  A whose  columns  are  indexed  by  P;  let  denote 

the  ith  column  of  A,  and  a. . its  (i, j)th  element.  All  matrices 

will  be  denoted  by  upper  case  letters. 

Vectors  will  be  denoted  by  lower  case  letters,  with  the 

distinction  between  row  and  column  vectors  being  omitted  when  the 

meaning  is  clear  from  the  context.  When  necessary  the  transpose 

T 

symbol  will  be  used  to  indicate  a row  vector,  b , for  some  column 
vector  b.  The  components  of  a vector  b will  be  denoted  by  b^. 

The  distinction  between  this  and  the  ith  column  of  the  matrix  B 
will  be  clear  from  the  context. 

e^  will  denote  the  pth  column  of  the  identity  matrix,  and  e 
will  be  used  to  denote  a vector  of  ones. 

g^(t)  will  denote  the  ith  derivative  of  g evaluated  at 
t.  g^(t+)  will  denote  the  ith  right  derivative  of  g,  and  similarly 
for  g(l)(t-). 

||"  ||  will  denote  the  Euclidean  norm. 

If  I = (t',t")  is  an  open  interval  then  I will  denote  the 
interval  [t’,t"). 

R and  (E  will  denote  the  reals  and  complexes  respectively. 
L^fOjT]  will  denote  the  space  of  all  real  valued,  Lebesgue 
measurable,  essentially  bounded  functions  on  [ 0, T] . L^[0,T]  will 

denote  the  space  of  real  valued  Lebesgue  integrable  functions  on 
[0,T].  When  the  time  interval  is  clear  from  the  context,  these  will 

be  written  as  L and  L..  respectively. 

00  1 

C°°  will  denote  all  infinitely  differentiable  functions 


from  R -»R. 


1” 

* 


l+(t)  - 0 for  t < 0,  = 1 for  t > 0. 

Chapters  are  numbered  from  1 through  5,  with  sections  and 
subsections  denoted  by  4.1,  4.2.1,  etc.  Appendices  are  numbered  A 
through  E with  subsections  likewise  denoted  by  C.l,  C.2,  etc. 
Equations,  lemmas,  propositions  and  theorems  are  all  part  of  the 
same  numbering  system  within  a chapter.  4(25),  4.2(25),  4.2.1(25) 
all  refer  to  the  same  equation  25  in  Chapter  4,  and  will  be  used  when 
referenced  from  another  chapter. 

[6]  will  denote  reference  number  6 in  the  bibliography. 
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CHAPTER  2 


ON  EXTREME  POINTS  AND  THEIR  CHARACTERIZATION 
AS  BASIC  FEASIBLE  SOLUTIONS 


The  simplex  method  of  linear  programming  is  based  partly  on 
the  following  fundamental  results  [6], 

(i)  If  f;Rn  -» R is  linear,  and  f is  bounded  below  on 

P = {x  6 Rn:Ax  = b,  x > 0} 

where  A € Rmxn  and  b € Rm,  then  f attains  its  infimum 

on  P at  an  extreme  point  of  P. 

(ii)  x e p is  an  extreme  point  of  P if  and  only  if  A.  has  full 

P 

column  rank,  where  3 = {i;x^  > 0). 

Because  of  the  characterization  in  (ii),  extreme  points  of 
polyhedral  sets  are  also  called  'basic  feasible  solutions.'  Note 
that  there  can  be  at  most  m positive  components  in  any  basic  fir.sible 
solution. 

In  this  chapter  we  shall  investigate  to  what  extent  similar 
statements  may  be  made  in  the  context  of  continuous  linear  programs. 

We  shall  begin  with  the  general  case,  and  then  specialize  various 
aspects  of  the  problem. 

The  general  problem  under  consideration  is 
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Minimize 


r 


(l) 


T 


f(x)  = / 
0 


c(t)  x(t)dt 


Subject  to 


t 

B(t)x(t)  + / K(t,s)  x(s)ds  = b(t) 
0 


x(t)  > 0 


where  K(t,s),  B(t)  £ RmXn,  x(t),  c(t)  £ Rn,  and  b(t)  £ Rm  for 
all  s,  t £ [0,T],  s < t. 

It  will  be  useful  to  write  the  constraints  in  operator 
shorthand: 

Ax  = b,  x > 0 

where  A maps  the  function  x to  another  function  denoted  by  Ax 
and  defined  by 

t 

(Ax)(t)  = B(t)x(t)  + / K(t,s)  x(s)  ds. 

0 

In  order  to  complete  the  problem  definition  we  need  to  specify 

the  spaces  in  which  the  variables  and  coefficients  lie,  e.g.  L . 

00 

This  will  be  done  when  required. 

For  a given  space  X and  constraint  coefficients  B(.), 

K(*,  •)  and  b(*)  define 

P(X)  = {x  £ Xn:Ax  = b,  x > 0) 

where  Xn  denotes  all  n-vectors  of  elements  in  X.  P(X)  is  easily 

seen  to  be  convex,  and  will  be  assumed  nonempty. 

11 


1 


We  shall  identify  any  two  functions  that  are  equal  a.e.,  and 
likewise  require  the  constraints  to  be  satisfied  a.e.  In  particular, 
if  x is  an  extreme  point  of  P(X)  and  y = x,  a.e.,  then  y is 
also  an  extreme  point  of  P(X). 


2.1.  Extreme  Points  as  Optimal  Solutions 

The  result  that  would  be  most  desirable  is  the  following: 

if  X is  a given  space  of  'nice'  functions,  and  if  a continuous 

linear  functional  f is  bounded  below  on  P(X)  then  f attains  its 

infimum  on  P(X)  and  moreover  does  so  at  an  extreme  point  of  P(X). 

However,  without  severe  pre-conditions  on  the  coefficients  defining 

the  problem,  the  only  case  when  this  appears  possible  is  when  X = L 

00 

and  P(L  ) is  bounded.  One  complicating  factor  is  that  while  the 
00 

notion  of  extreme  points  is  purely  algebraic,  one  seems  to  require 
heavy  topological  machinery  to  establish  merely  their  existence. 

(2)  Theorem.  If  the  components  of  c(-)>  b(*)>  B( • ) and  K(*,*) 
are  all  in  L^,  and  there  is  an  M > 0 such  that  x € PCL^)  =*  ||x(t)||<M 

a.e.  then  f(*)  attains  its  infimum  at  an  extreme  point  of  P(L  ). 

00 

The  proof  of  this  result  will  follow  immediately  from  the 
following  well-known  lemma  once  the  compactness  of  PCL^)  is 
established  in  a suitable  topology. 
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(3)  Lemma.  Let  Y be  a locally  convex  Hausdorff  space  and  let 
f:Y  ->  R be  concave.  If  Q C Y is  compact  and  f is  lower  semi- 
continuous  on  Q then  f attains  its  infimum  on  Q at  an  extreme 
point  of  Q. 

Proof.  See  [16,  p.  7M.  □ 


Under  the  conditions  of  Theorem  (2) 

(i)  P(Lro)  is  weakly  compact  as  a subset  of  L^. 

(ii)  P(Lot)  is  weak  * compact.1- 

Proof. 

(i)  See  [lU,  p.  4o]. 

(ii)  The  proof  of  this  is  in  the  same  spirit  as  that  of  (i)  and  will 
be  omitted.  □ 


Proof  of  Theorem  (2) . 


Let  Y be  the  space  L“  with  the  weak  topology.  By  Lemma 

(U)  P(L  ) is  compact  in  Y.  Since  c £ Ln,  f is  continuous  on  Y. 
00  00 

Noting  that  Y is  a locally  convex  Hausdorff  space,  we  can  apply 
Lemma  (3),  and  the  proof  is  complete.  □ 

H’or  interest  sake  we  state  both  compactness  results  even  though 
only  one  is  required. 


Remark.  The  same  proof  is  valid  if  we  choose  Y to  be  Ln  with 
00 

the  weak  * topology. 

In  practice  we  would  like  to  have  optimal  solutions  that  are 
more  manageable  than  general  measurable  solutions,  for  example  piecewise 
analytic  solutions  having  only  finitely  many  breakpoints.  Theorem  (2) 
unfortunately  is  the  best  statement  available  and  it  is  still  an  open 
question  whether  or  not  it  can  be  improved  upon  even  in  very  special 
cases.  Even  if  we  know  that  the  optimum  has,  say,  a piecewise  analytic 
solution,  there  is  no  guarantee  that  there  is  a piecewise  analytic 
extreme  point  solution.  However,  there  is  a motivation  for  continuing 
the  study  of  extreme  points  in  more  useful  spaces,  given  by  the 
following  simple  result. 

(5)  Proposition.  If  f:Xn  -> R is  concave  and  x is  the  unique 
minimizer  of  f over  P(X),  then  x is  an  extreme  point  of  P(X). 

Proof.  This  follows  immediately  from  the  concavity  of  f and  the 
definition  of  an  extreme  point.  □ 


2.2.  Extreme  Points  as  Basic  Solutions 

Let  us  begin  by  reviewing  the  proof  of  the  characterization 
of  extreme  points  in  Rn  given  at  the  beginning  of  this  chapter. 

We  have  a given  x € Rn  satisfying 

Ax  = b,  x > 0. 
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The  necessary  and  sufficient  condition  for  x to  be  an  extreme  point 


i is  that  A.  Q has  full  column  rank,  where  0 = {i:x.  >0}.  . 

p 1 I 

To  show  the  sufficiency,  we  write  x = Ay  + (l-A)z  for 

I 

A £ (0,1)  and  some  y and  z satisfying  the  constraints,  and  then 
show  that  x = y = x.  Set  a = =0).  XQ  = 0,  y > 0 and  z > 0 

implies  ya  = za  = °-  Therefore  A.^y^  = A.^z^  = A.^xp  = b.  Since  j 

A.q  has  full  column  rank  these  equations  have  a unique  solution,  and 
P 

we  are  done. 

To  show  the  necessity,  assume  that  A.^  does  not  have  full  \ 

column  rank.  Then  there  exists  a yQ  ^ 0 such  that  A.QyQ  = 0. 

P P P 

Setting  y =0  and  noting  that  x.  > 0,  there  is  a 0 > 0 such  : 

OL  p 

that  x + 0y  > 0,  x - 0y  > 0.  Hence  writing  x = ^ (x  +9y)  +^(x  -9 y) 

>j 

shows  that  x is  not  an  extreme  point. 

It  is  precisely  these  two  steps  that  we  shall  mirror  in  the 
continuous  case,  namely 

(i)  being  able  to  solve  uniquely  for  the  positive  components  when 
the  remaining  ones  are  held  at  zero,  and 
(ii)  being  able  to  perturb  the  positive  components  to  either  side  when 
they  are  not  uniquely  determined. 

Before  continuing,  we  remark  that  in  order  to  obtain  a similar 
characterization  in  a function  space  we  shall  have  to  severely  restrict 
the  class  of  admissible  solutions  as  well  as  the  constraint  coefficients. 

The  reason  for  this  is  that  the  constraints  in  (l)  can  have  extreme 
points  for  which  no  characterization  of  the  above  form  is  possible. 


Example  B(l)  Is  such  a case.  Also,  for  computational  purposes  it  would 

be  preferable  if  we  could  solve  locally  for  x(t)  at  time  t,  without 

any  considerations  of  the  future.  In  Example  B(l)  the  values  of  x 

on  [0,1)  are  determined  by  constraints  that  hold  over  the  interval 

[ 1, U) . In  Example  B(2)  the  slope  of  x at  0,  and  hence  the  whole 

solution,  is  determined  only  at  time  T by  the  restriction  x(T)  = 0. 

As  in  the  case  of  linear  programming  we  are  here  aiming  at 

the  following  restatement  of  the  continuous  linear  program  (l): 

Choose  a partition  of  [ 0, T ] into  time  intervals  (I.)  and  an 

J 

associated  partition  of  the  variables  {(a. ,3.))  such  that  with 

J J 

x =0  on  I.,  the  remaining  variables  x.  are  uniquely  determined, 
and  yield  the  optimal  solution.  By  our  comments  in  the  preceding 
paragraph,  we  moreover  want  the  value  of  x(t)  to  be  determined  only 
by  our  choice  of  which  variables  are  to  be  held  at  zero  on  [0, t], 
and  not  by  what  happens  to  the  right  of  t. 

In  the  remainder  of  this  chapter  we  shall  work  with  the  space 
of  right  analytic  functions,  defined  below. 

(6)  Definition.  A function  g:[0,T]  ->  R will  be  called  right  analytic 
if  for  each  t 6 [0,T),  there  is  an  e > 0 and  an  analytic  function 
h:(t-e,t+e)  -*  R such  that  g(s)  = h(s)  V s € [t,t+e). 

We  shall  let  &r  denote  the  space  of  bounded  right  analytic 
functions  on  [ 0, T ] . The  required  properties  of  these  functions  are 
established  in  Appendix  A.  Our  motivation  for  choosing  the  class  Q y 
is  that  it  seems  to  be  the  largest  class  for  which  the  local  uniqueness 
result,  Proposition  (9)  below,  can  be  established. 
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The  following  is  a key  result  in  the  subsequent  analysis. 


(7)  Lemma.  Let  g:[0,T]  -*  R have  right  analytic  components 

g^C-)?  i = 1, ...,n.  Then  there  exists  a (possibly  infinite)  disjoint 

family  of  open  intervals,  (1^),  such  that  U L = [0,T),^  and  such 

that  for  each  interval  I,,  each  g.  satisfies 

J 

(i)  g^  is  analytic  on  1^ 

(ii)  either  J g^  | >0  on  I.  or  g^^  = 0 on  I,. 


Proof.  See  Appendix  A.  □ 


For  a given  x £ P(fl  ) and  its  associated  partition  (I.),  let 

^ J 


and 


a = {i.-Xj^  =0  on  I } , 


Pj  = {i:xi  >0  on  1^) 


Let  t'  and  t"  denote  the  endpoints  of  I . Then  using  the  constraints 
J J J 


(l),  the  equation  for  x„  on  Ij 


becomes 


(8) 


B.  (t)  x (t)  + [ K.  (t,s)x.  (s)ds  = dJ(t) 
Pj  t;  Pj  Pj 

J 


for  all  t £ [t',t"),  where  dJ(t)  = b(t)  - / J K(t,s)x(s)ds 


y j 


^If  I = (t',t")  is  an  open  interval  then  I denotes  [t',t"), 


J 
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(9)  Proposition.  Let  x € P(fir)  and  let  { I ^ ) be  the  associated 

partition  of  [0,T]  given  by  Lemma  (7).  If  equation  (8)  has  a 

unique  right  analytic  solution  on  each  interval  I , then  x is  an 

J 

extreme  point  of  P(flr). 


Proof.  Suppose  that  x is  not  an  extreme  point.  Then  there  exist 

y,  z £ P(dr)  and  A £ (0,1)  such  that  x = Ay  + (l-A)z 

and  for  some  t £ fO,T),  x(t)  ^ y(t).  By  Lemma  A. 5 there  exist 

0 < r < s < T such  that  x = y on  [0,r)  and  x ^ y on  (r, s). 

By  Lemma  (7)  there  is  an  interval  I.  = (t'.,t'.')  of  the  partition 

J J J 

such  that  t'  < r < t".  By  analogy  with  the  sufficiency  argument 
J J 

presented  in  Rn,  it  is  clear  that  on  I.,  y satisfies 


B-B  (t)  yB  (*)  + / K.  (t,s)  y (s)ds  = e3(t) 
j i t'  pj  pj 

J 


for  all  t £ [t'.,t'.']  where 
J o 


i J 

e (t)  = b(t)  - / K(t,s)  y(s)ds 


However,  since  y = x on  [0,r),  it  follows  that  e'-’(t)  = dJ'(t) 

= 0 


on  I,.  Thus  y satisfies  (8)  on  I.,  Since  y 
J J “j 


a 


j 


on  I.,  y ’ ^ x on  I n (r, s)  / contradicting  the  uniqueness 
3 pj  3 

hypothesis.  □ 
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Remarks. 

(i)  In  sum  we  have  shown  that  being  able  to  solve  uniquely  for  the 
positive  components  locally  is  indeed  a sufficient  condition  for 
a right  analytic  solution  to  be  an  extreme  point.  The  important 
part  in  the  proof  played  by  the  right  analyticity  was  that  when 
given  x,  y 6 P(flr)  with  x / y,  we  could  find  an  earliest 
interval,  1^.,  in  the  partition,  on  which  x / y.  An  example 
where  we  cannot  find  a first  interval  is  the  following:  Let 
x(t)  = [t  sin(l/t)]+  (i.e.,  positive  part),  and  y(t)  = t on 
[0,1].  Clearly,  in  the  partition  of  [0,1]  induced  by  x, 
there  is  no  first  interval  on  which  x and  y differ.  Note 
further  that  the  theorem  is  false  if  the  solution  is  right 
analytic  but  we  choose  a partition  [I.]  that  does  not  satisfy 

[0,T)  = ll"  I..  This  is  illustrated  in  Example  B(2). 

J J 

(ii)  Example  B(l)  shows  that  local  uniqueness  is  not,  in  general, 
a necessary  condition  for  a solution  to  be  an  extreme  point. 


Vie  can  now  proceed  to  find  algebraic  conditions  on  the  coefficients 
that  ensure  unique  solutions  to  equations  of  the  type 


D(t)  x(t)  + [ L(t,s)  x(s)ds  = g(t),  t £ [ t ' , t " ) 

t' 


Since,  by  Example  B(l),  such  conditions  cannot  in  general  also  be 
necessary  conditions  for  uniqueness,  we  shall  confine  ourselves  to 
the  case  where  the  necessity  has  been  established.  This  is  the  time 
invariant  case,  i.e.  when  D and  L are  constants. 
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Before  doing  so,  we  remark  briefly  that  in  the  event 
D(t)  = I,  equation  (10)  is  a Volterra  equation  of  the  second  kind,  and 


it  is  well-known  that  such  equations  always  have  unique  solutions, 
provided  that  the  coefficients  g^(*)  and  (•,•)  are  in  L^. 

See  for  example  [25,  p.  10].  Thus  if  D ^(t)  exists  a.e.  and 
(D"1g)i,  (D-1L)  € L2,  we  also  obtain  uniqueness.  In  general,  however, 

D(t)  may  be  singular.  This  case  has  been  studied  by  Dolezal  [11], 
and  in  differential  equation  form  by  Silverman  [23].  However  their 
work  provides  only  a partial  answer,  and  a general  succinct  unique- 
ness condition  has,  to  our  knowledge,  yet  to  be  discovered. 

In  the  time  invariant  case,  equation  (10)  reads 


(11) 


t 

Dx(t)  + L / x(s)ds  = g(t),  t £ [ t • , t ")  . 

t' 


This  equation  has  been  thoroughly  studied  in  its  more  conventional 
differential  equation  form  by  a number  of  authors.  See  for  example  [5], 
[11],  [23],  and  also  Appendix  C.  The  uniqueness  condition  we 

require  is  the  following: 


(12)  Lemma.  If  the  components  of  g(*)  are  analytic  and  x satisfies 
(ll),  then  a necessary  and  sufficient  condition  for  x to  be  the 
unique  analytic  solution  is  that  there  exist  a scalar  ^ such  that 
uD  + L has  full  column  rank. 

Proof.  See  [53.  □ 


Remark.  This  full  rank  condition  has  several  interesting  interpre- 
tations. If  D and  L are  square,  then  + L has  full  rank  iff 

det(,iD  + L)  is  not  identically  zero  as  a function  of  n. 1 On  dividing 

hy  n and  setting  e = l/M  the  condition  reads:  D + eL  has  full 

column  rank  for  all  e sufficiently  small.  If  we  replace  (ll)  hy 

its  discrete  time  analog  using  time  intervals  of  stepsize  e,  we 
obtain  a block  lower  triangular  coefficient  matrix  with  each  diagonal 
block  being  D + eL.  Clearly  this  block  triangular  matrix  has  full 
column  rank  iff  D + eL  has  full  column  rank.  Another  interpretation 
can  be  made  by  taking  Laplace  transforms  on  both  sides  of  (ll)  with 

dummy  variable  The  coefficient  matrix  of  the  Laplace  transform 

1 2 

of  x so  obtained  is  precisely  D + — L. 

M 

In  order  to  show  that  this  full  rank  condition  is  also  a 
necessary  condition  for  a solution  to  be  an  extreme  point,  we  require 
the  following  lemma. 

(13)  Lemma.  If  there  is  no  scalar  ^ such  that  + L has  full 
column  rank,  then  given  any  x > 0,  there  exists  a nontrivial  analytic 
solution  to  the  homogeneous  equation 

t 

(14)  Dx(t)  + L / x(s)ds  =0,  t > 0 

0 

^Note  that  det(^D  + L)  is  a polynomial  in  4.  Hence  it  is  zero  either  for 
all  n or  for  at  most  finitely  many  ji. 

2 

See  also  Appendix  C. 


21 


satisfying 


f x(s)ds  = 0 . 
0 


Proof.  We  use  an  argument  similar  to  that  given  in  [ 5 , p.  4l8] . 

By  assumption,  for  each  p there  exists  a nonzero  (constant)  vector 

cp  such  that  (pD  + L)fp  = 0. 

M d 

If  D has  k columns  let  M > 2k  he  any  integer  and  let 
p^, ...,p„  be  any  distinct  scalars.  Let  G be  the  following 
2k  x M matrix; 


G 


cp  cp  . . . cp 

Ul 


M1T„ 


^1 


e cp 


pm 


Since  M > 2k  the  columns  of  G are  linearly  dependent.  Hence  there 
exists  a nonzero  vector  = (r|^, . . . such  that  Gt)  = 0. 

Set 

M Wit 

x(t)  = L Ti.p.e  cp 

i=l  1 1 ^i 


Then  it  is  easily  verified  that  x satisfies  (l4)  and  that 

T 

/ x(s)ds  = 0.  Moreover,  x is  not  identically  zero.  □ 

0 


We  can  now  obtain  the  main  result  of  this  chapter. 
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(15)  Theorem.  Characterization  of  right  analytic  extreme  points. 


Let  x be  right  analytic  and  satisfy 

t 

(l6)  Bx(t)  + K [ x(s) ds  = b(t),  x(t)  > 0,  t € [0,T], 

0 

where  B and  K are  constant.  Let  {I.}  be  the  associated  partition 
of  [0,T]  given  by  Lemma  (7),  and  define 

a.  = {i:x.  =0  on  I .) 

3 i 3 

p . = ( i : x . > 0 on  I.) 

3 i 3 

Then  a necessary  and  sufficient  condition  for  x tr>  be  an  extreme 
point  is  that  for  each  j,  there  exists  a scalar  ..  such  that 

H. B.p  + K.p  has  full  column  rank. 

J J 

Proof.  Sufficiency.  By  Lemma  (12),  x„  is  uniquely  determined  on 

: — 

I. .  By  Proposition  (9)  x is  an  extreme  point. 


Necessity.  Suppose  there  is  a j such  that  for  all  ^B.  + K. 

does  not  have  full  column  rank.  Since  xQ  (t)  >0  on  I.,  there 

is  a closed  interval  [u, v]  c I.  and  an  e > 0 such  that  x.(t)  > e 

J * 

for  t € fu,v]  and  i £ 0..  By  Lemma  (15)  there  exists  a nonzero 

J 

analytic  y„  (•)  satisfying 

ej 


- Vg'  f • 
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t e [u,v) 


■p.yp. 

j j 


(t)  + K. 


t 

/ yR 

u P1 


(s)ds  = 0, 


and 


v 

/ (s)ds 


0 . 


Rescale  so  that  | y . ("b)  | < e for  all  t £ [u,v)  and  i £ (3.. 

J 

By  construction  of  yD  in  Lemma  (13)  this  can  always  be  done. 

Set  ya  = 0 on  [u,v)  and  y = 0 on  [0,T]\  [u,v).  Then  by 

j 

construction  y satisfies 


and 


t 

By (t)  + K / y(s)ds  = 0 
0 


x(t)  + y(t)  > 0 
x(t)  - y(t)  > 0 


for  all  t £ [0,T],  Hence  both  x + y and  x - y satisfy  (l6).  ■ 
Since  y is  not  identically  zero,  it  follows  that  x is  not  an 
extreme  point.  □ 

2.3.  Definition  of  a basic  feasible  solution 

With  the  above  characterization  of  right  analytic  extreme 
points,  we  can  now  make  precise  the  notion  of  basic  feasible  solutions 
in  continuous  time.  This  will  be  fundamental  in  laying  the  ground 
work  that  will  be  used  in  the  remaining  chapters  to  develop  a con- 
tinuous time  simplex  method. 

2h 


* • -• 


From  now  on  we  shall  concern  ourselves  only  with  extreme  points 
having  finitely  many  breakpoints. 

(17)  Definition.  Let  x:[0,T]  ->  Rn  satisfy 
t 

Bx(t)  + K / x(s)ds  = b(t)  , x ( t ) >0,  t £ [0,T] 

0 

where  B,  K £ Rmxn.  Then  x will  be  called  a basic  feasible  solution 
if  there  are  finitely  many  points 


0 " *0  < *1  < 


< t = T 
k 


and  for  each  j = 1,  ...,k,  a partition  (a.,P.)  of  the  variables  such 

J J 

that 

(i)  |P,|  = « 

J 

(ii)  xa  (t)  = 0,  t € (t^^t  ) 

c) 

( iii)  det(|aB.  + K.  ) / 0 for  some 

If  in  addition  x0  >0  on  the  open  interval  (t . ,,t.) 

Pi  — J-l  J 

for  each  j,  and  x (t.)  >0,  x (t.)  >0  for  r c B.  D B.  , , 

j'  r y *0  Hj+l 

then  x will  be  called  non- degenerate.  On  the  interval  [t.  ,t.), 

J “ J 

x and  x will  be  called  basic  and  nonbasic  variables  respectively. 

aj 

Note  that  if  b(')  is  analytic  then  by  the  result  of 

Proposition  C.3(l8)  any  basic  feasible  solution  x will,  for  each 

interval  (t . , t.),  agree  with  some  function  yJ(*)  that  is  analytic 

J J 
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on  a neighborhood  of  In  particular  this  implies  that  both 

the  left  and  right  limits,  x^(tT)  and  x^(t+)  exist  for  all 
j = 1,  ...,k  and  all  i = 0,1,2, ...  . 


CHAPTER  3 


THE  SIMPLEX  METHOD  IN  Rn  AND  CONTINUOUS  TIME  VERSUS  DISCRETE  TIME 

The  purpose  of  this  chapter  is  to  motivate  the  approach  we 
shall  take  in  developing  the  theory  of  a simplex  method  in  continuous 
time.  In  Chapter  2 we  saw  that  it  was  possible  to  characterize  all 
right  analytic  extreme  points  as  ''basic  feasible  solutions"  in  a 
manner  very  much  akin  to  that  of  linear  programming.  Here  we  show 
that  the  naive  generalization  of  the  simplex  method  to  continuous  time 
is  inadequate  for  this  class  of  problems  and  that  additional  new 
techniques  are  required. 


3.1.  The  Simplex  Method  in  Rn 

We  first  briefly  describe  a single  iteration  in  the  simplex 
method.  While  this  is  well-known  [ 6 ],  we  present  it  to  illustrate 
and  motivate  the  approach  in  continuous  time. 

The  problem  is 

minimize:  cx 

subject  to;  Ax  = b,  x > 0,  x 2 Rn 
be  a given  feasible  solution.  Then  for  any  A 
cx  = cx  + A(b  - Ax)  . 


(1) 

Let  x 
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Upon  rearrangement  this  becomes 


(2)  cx  = (c  - AA)x  + Ab. 

To  obtain  an  algorithm,  the  following  condition  is 
required. 

(3)  Complementary  slackness  condition. 

x.  >0=>c  -AA..=0,  i = 1, . . . , n 

1 11 

The  following  simple  but  fundamental  lemma  follows  immediately 
by  inspection. 

(1+)  Lemma.  Let  x and  A be  given  as  above  and  suppose  that  the 

complementary  slackness  condition  (3)  holds.  Then  if  c - AA.  < 0, 

s s 

and  xg  (currently  at  zero)  can  be  increased  so  that  only  the 
positive  components  of  x need  be  adjusted  to  maintain  the  relations 
Ax  = b,  x > 0,  the  new  solution  obtained  in  this  way  will  have  a lower 
objective  value.  a 


The  simplex  method  uses  this  lemma  in  the  following  way. 
Begin  with  any  extreme  point  x,  and  let  p = (i:x^  > 0}. 
For  this  discussion  we  assume  "nondegeneracy,"  i.e.  the  matrix  A. 
is  square  and  nonsingular.  Proceed  now  as  follows: 


3 


Step  1.  Solve  for  the  prices.  Require  complementary  slackness,  i.e. 

cQ  - AA. _ = 0,  and  solve  uniquely  for  A. 

P P 


1 

c 


AA  is  called  the  reduced  cost  of  x. 
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Step  2.  "Pricing":  Select  £ such  that  c - Aa  <0.  If  no  such 

i/  S/j 

£ exists  we  are  optimal — stop. 


Step  3.  Represent  the  incoming  activity:  Solve  the  system  A.  y = a . 

PS  £' 

Step  4.  "Minimum  Ratio";  Increasing  to  some  level  0 yields 

a new  xQ  given  by 

P 

xs = XP  - V 

To  obtain  the  maximum  decrease  in  the  objective,  increase  0 

as  far  as  possible,  but  maintaining  x'  = xQ  - y 0 > 0. 

P P P — 

Under  nondegeneracy  precisely  one  component  of  x',  say 

P 

x^,  will  be  zero,  r is  determined  by 

( X'  ) 

r = argmin  | : yi  > 0 j 


and  the  maximum  0 is  given  by 


where  we  set  y = 0.  Define  the  new  basis  to  be 
~P 

P'  = P U [£)  ~ {r},  set  x^  = 0,  and  proceed  to  Step  1.  □ 


3.2.  Continuous  Time  Versus  Discrete  Time 

A priori  there  appears  little  reason  why,  without  any  further 
qualification,  we  cannot  apply  the  above  simplex  steps  to  the  continuous 
time  problem.  The  point  to  note,  however,  is  that  if  we  are  to  work 
with  solutions  having  activities  basic  over  intervals  of  time,  then 


in  order  to  get  a decrease  in  the  objective  value  by  a change  of  basis, 
this  change  will  have  to  occur  over  an  interval(s)  of  time.  The 
discrete  time  analog  of  the  above  process  would  correspond  to  a simultaneous 
exchange  of  many  activities  outside  the  basis  with  those  in  the  basis. 

This  differs  from  the  simplex  method  where  we  do  these  exchanges  one 
at  a time. 

In  this  case,  however,  Lemma  (4)  still  applies:  By  a simultaneous 
increase  of  several  activities  having  negative  reduced  costs,  we  can 
still  obtain  a strict  decrease  provided  we  only  adjust  existing  positive 
(basic)  activities  to  maintain  the  constraints.  What  goes  'wrong'  in 
continuous  time  is  that  without  some  basic  variables  becoming  negative, 
this  can  seldom  be  done.  In  order  to  make  a change  of  basis  over  an 
interval  of  time,  we  usually  in  addition  have  to  adjust  nonbasic 
activities,  which  may  well  have  positive  reduced  costs.  However,  once 
we  do  this  there  is  no  immediate  reason  why  the  intended  change  of 
basis  should  yield  an  improvement. 

Let  us  illustrate  with  an  example. 


(5)  Example. 

2 

minimize:  / { (t-2)x  + 2x  + tx  )dt 

0 . ^ 

t 

subject  to:  X^:  x^(t)  - / x2(s)ds  = 1-t 


X?i  x2(t)  + x^Ct)  = 2 
x^t)  >0,  t € (0,2]. 


W 
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We  shall  first  show  that  what  happens  in  continuous  time  and 
then  interpret  this  carefully  in  discrete  time. 

Begin  with  the  following  basic  solution: 

[0,1):  x (t)  = 1 - t,  x (t)  = 0,  x (t)  = 2 
(6)  1 25 

[1.2] :  xx(t)  = 0 , x2(t)  = x3(t)  = 1 

By  requiring  complementary  slackness  over  these  two  intervals, 
we  can  uniquely  solve'  for  the  prices  M-)>  and  obtain; 

t 

(1.2] :  ^(t)  = -1 

Ag(t)  = t 

cL(t)  = t-1  > 0 \ 

[0,1]:  \(t)  = t-2 
Ag(t)  = t 

c2(t)  = - | (t-1)2  < 0 

where  c^(*)  is  the  reduced  cost  of  the  nonbasic  variable  x^  on  the 
appropriate  interval. 

Examination  of  the  reduced  costs  shows  that  we  should  increase 
xrj  in  the  first  interval.  Suppose  we  do  this  by  exchanging  x0  and 
x,  on  the  interval  [0,e),  e >0.  This  will  yield  the  following 
solution  on  [0, 1) : 

^This  procedure  will  be  well  defined  later  on. 


31 


T 


[0,e):  x1(t)  = 1 + t,  x^(t)  = 2,  x^(t)  = 0 

[e,l):  xL(t)  = 1 + 2c  - t,  x2(t)  = 0,  x^(t)  = 2 


To  obtain  the  new  solution  on  [1,2],  we  would  like  to  maintain 
basic  the  same  variables  as  in  the  original  solution,  i.e.  x,  and 
Xy  Thus,  keeping  x^  = 0 over  [1,2],  we  require  x and  x,  to 
satisfy 


- / x0(s)ds  = 1 - t + 2e 
1 

x2(t)  + x^(t)=  2. 


t e [1,2] 


However,  the  only  solution  to  these  equations  is 


x2(t)  = 1 - 2e6(t-l) 
Xj(t)  = 1 + 2e6(t-l)  , 


t € [1,2] 


where  6(t)  is  the  delta  functional.1 

We  observe  immediately  that  x, ,(t)  = -°o  at  t = 1,  and  thus 
x there  fails  to  satisfy  the  nonnegativity  restriction.  Hence  it 
is  not  possible  to  adjust  only  the  basic  variables  and  remain  feasible. 

If  we  did  not  require  x^  = 0 over  [1,2],  and  instead  held 
it  fixed  at  Xj(t)  = 2c,  its  value  immediately  prior  to  t = 1,  we 
would  obtain  the  solution 


x1(t)  = 2c 

*2(t)  = 1 

Xj(t)  = 1 . 


t € [1,2] 


b may  be  thought  of  as  a "function"  that  is  zero  everywhere  except  at 
the  origin  where  it  is  so  large  that  it  integrates  to  1.  See  Appendix  C. 
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However,  this  solution  is  not  a basic  solution  and  moreover  has  an 
objective  value  that  is  greater  than  that  of  the  original  solution. 
This  is  because  the  reduced  cost  of  x^  over  [1,2]  is  positive, 
and  in  this  case  keeping  at  2e  over  [1,2]  yields  a positive 

contribution  to  the  objective  that  dominates  the  decrease  obtained 
over  [0,e).  See  Figure  1 below. 


The  approach  we  shall  pursue  will  be  to  allow  the  breakpoint  at 
t = 1 to  adjust  so  that  x^  can  be  allowed  to  reach  zero  continuously 
and  then  be  kept  at  zero  level  from  then  onwards.  This  will  yield  the 
following  solution: 


'W 


33 


10, e):  x1(t)  = 1 + t 

[e,  1 + 2e ) : x^t)  = 1 + 2t 
[l  + 2e,  2):  x^t)  = 0, 


x,  (t)  = 2,  x (t)  = 0 , 
t,  xy(t)  = 0,  Xj(t)  = 2 , 
xy(t)  = 1,  x^(t)  = 1 . 


/ \ 


\ old  xx 


N A— new  x., 
\ ^ 1 

\ 

\ 


1 + 2e 


FIGURE  2 


Here  we  have  preserved  on  t > e the  same  sequence  of  basic 
variables  as  before  and  have  only  altered  che  timing  of  the  change 
over  from  {x^,x^}  to  {x0,x^}.  Thus  we  end  up  with  a new  basic 
feasible  solution.  In  addition,  even  though  is  now  positive 

over  the  interval  [ 1,  1 + 2e)  where  it  has  a positive  reduced  cost 
it  turns  out  upon  recalculation  of  the  objective  value  that  an  over- 
all strict  decrease  is  achieved  for  c sufficiently  small.  Indeed, 


setting  c = l/3  yields  the  optimal  solution  (easily  verified  by 
recomputing  the  new  reduced  costs). 


If,  for  a given  e sufficiently  small,  we  consider  this  to  be 
a single  iteration,  then  it  is  certainly  not  a case  to  which  Lemma  (4) 
applies.  However,  as  the  analysis  of  the  discrete  case  will  show, 
we  can  think  of  this  as  an  infinite  number  of  basis  changes,  switching 
back  and  forth  between  t = 0 and  t = 1+,  and  accompanied  by 
repricing  an  infinite  number  of  times. 

To  discretize  the  problem,  we  divide  the  time  interval  [0,2] 
into  N intervals  of  equal  size,  and  approximate  the  integral  using 
the  leftmost  function  values.  Let  6 = 2/N.1  Then  the  discrete  formu- 
lation becomes 

N-l 

minimize:  Z {(kb -2)  x (kb)  + 2x  (kb)  + kbx, (kb) } 8 

k=0  123 

subject  to: 

r-1 

A, (r&):  x.(rb)  = Z x (kb)b  + (l  - rb) 
k=0  ^ 

Ag ( rb ) : x2(rb)  + x^(rb)  = 2 

x^(rb)>0,  r = 0, 1, . . . , N 

The  detached  coefficient  tableau  is  given  in  Table  1. 

The  initial  discrete  basis,  corresponding  to  the  initial 
continuous  basis  (6)  is  indicated  by  the  heavily  circled  elements 
in  the  tableau.  These  are  the  pivotal  elements.  Note  that  for 

^Not  to  be  confused  with  the  5 functional. 
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* *7;  ',v,  • 


t < 1,  and 


are  basic;  at  t = 1,  all  of  x^,  x0. 


are 


■ 


basic;  for  1 < t < 2,  x^  and  x,  are  basic,  and  at  t = 2,  only 
x,  is  basic.  Thus  in  the  discrete  case,  we  have  a surplus  basic 
variable  at  t = 1 and  are  one  short  at  t = 2. 

We  now  solve  backwards  in  time  for  the  prices.  As  in  the 
continuous  case  it  turns  out  that  x^(0)  has  a negative  reduced  cost. 
Increasing  x^(0)  has  the  effect  of  dropping  x^(l)  from  the  basis. 
This  exchange  is  indicated  in  Table  1 where  the  element  -5  with 


the  dotted  circle  is  not  the  new  pivotal  element  for  x,(0).  Note 
that  the  surplus  at  t = 1 has  now  shifted  to  t = 0. 


Table  1 


l(2-6)  l(2) 

c(2-8)  c(2 ) (RHB) 


*(0) 

(COST)  c(0) 


The  next  step  is  to  solve  again  for  the  prices.  However  in 
this  case  we  cannot  sequentially  solve  backwards  for  A(')>  since  the 
value  of  A-^(l  + 8)  can  only  be  determined  once  the  value  of  \^(0) 

has  been  found.  Solving  yields  A,  (l  + &)  - xr  - so  that  as 

1 do  d 

8 -»  0,  A^(l  + &)  ->  °o.  The  reduced  costs  of  the  nonbasic  variables 
are  all  of  order  unity  except  for  that  of  x^(l  + 8)  which  has  value 
8 - r;  - ■—•«  0.  Hence  we  now  introduce  x^(l  + 8)  into  the  basis. 
This  has  the  effect  of  dropping  x^(0),  and  so  transfers  the  basis 
surplus  from  t=0  to  t = 1 + 8.  See  Table  1. 

Below  we  sketch  the  effect  of  these  two  iterations  on  x^(*). 
Clearly,  the  simplex  method  in  discrete  time  is  behaving  in 
the  same  way  as  the  proposed  step  in  the  continuous  case  for  small  e. 

The  remainder  of  this  thesis  will  be  devoted  to  making 
rigorous  the  above  principle  in  continuous  time,  viz.:  if  an  activity 
prices  out  favorably,  then  without  further  pricing,  it  can  be  in- 
creased over  a sufficiently  small  interval  to  yield  a decrease  in  the 
objective  value. 


CHAPTER  4 


ON  A CONTINUOUS  TIME  SIMPLEX  METHOD 

In  this  chapter  we  develop  a theory  for  a continuous  time  simplex 
method.  First  we  consider  the  optimality  conditions  and  the  notion  of  a 
reduced  cost  in  continuous  time.  Then  we  show  how  to  move  f^om  one 
basic  feasible  solution  to  a nearby  one  with  a lower  objective  value. 


4. 1.  Optimality  Conditions  and  Weak  Duality 

As  in  the  simplex  method,  or  most  other  constrained  optimization 
problems,  the  first  step  is  to  modify  the  cost  functional  by  adding  to 
it  certain  linear  combinations  of  the  constraints. 

Recall  that  our  problem  is 


T 

minimize:  / c(t)  x(t)dt 

0 


(1) 


t 

subject  to:  Bx(t)  + K f x(sjds  = b(t) 

0 

x ( t ) >o,  t e [o,  t] 


where  we  assume  that  b(*)  and  c(’)  are  analytic,  and  B,  K 6 RmXn. 
The  following  Lemma  is  easily  obtained. 
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(2)  Lemma.  Let  x(-)  satisfy  (l),  and  let  0 = tQ < t^ < - . . < t^ = T 
be  any  points  such  that  the  derivatives  x^(t.-)  exist  for  all 

1 * m O 

i = 1, . . .,k  and  0 < j < £.  Let  A : [0,T]  ->  R be  any  function 
and  vJ1,  j = 0, 1, i = 1, ... , k be  any  vectors  in  Rm.  Then 


T 

(5)  / c (t ) x(t)dt 


T k Hj 

=/  A*(t)  b(t)dt  + E E b(j)(t.) 

0 i=l  j=0  1 

T k i 

+ / c(t)  x(t)dt  -EE  dJ1  x'J'(t7) 
0 i=l  j=0  1 


where  c(-)  and  {d*^}  are  defined  by 


T k 

(1+)  c(t)  = c(t)  - A*(t)B  - / A*(s)ds  K - E vQl  K 1 (t.  - 1) 

t i=l  1 


(5) 

and  we  define 


dJi  = vdiB 


VJ+1’ iK 


for  j > l. 


Proof.  By  assumption  we  may  differentiate  the  constraints  (l)  and 
evaluate  all  quantities  at  t = t”.  Thus 

■'"Some  k,  t, . 

2 * 

We  assume  both  x and  A are  integrable. 

5l+(t)  =0  if  t < 0 
=1  if  t > 0. 
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b^^(ti)  - Bx^(t")  - Kx^-1^(t“)  =0 
° < j < i,  1 < i < k 

where  we  define 

x^-1^(t)  = / x(s)ds  . 

0 

Now  left  multiply  the  (i,j)th  relation  by  and  add  it  to  the  cost 

-X- 

functional.  Also,  left  multiply  (l)  by  X (t),  integrate  over  [0,T], 
and  add  that  result  to  the  cost  functional.  This  yields 

T 

/ c(t)  x(t)dt 
0 

T T t 

= / c(t)x(t)dt  + / A*(t){b(t)  -Bx(t)  -K  / x(s)ds]dt 

0 0 0 

+ Z Z Vjl{b^(t.)  - Bx^(t")  - Kx^_1^(t7)} 

k=l  j=0  1 1 1 

Interchanging  the  order  of  integration  and  regrouping  terms  yields 
the  result.  □ 

We  remark  that  Lagrangian  constructions  of  this  type  con- 
sidered by  previous  authors  in  this  area  (see  Chapter  1)  have  been 
concerned  only  with  the  case  vJ1  = 0.  i >r  our  purposes,  however, 
allowing  them  to  be  nonzero  is  essential.  We  shall  later  see  that 
the  vJi  can  be  interpreted  as  the  coefficients  of  B's  and  their 
higher  order  derivatives,  . 1 

^See  Appendix  C for  definition  of  6^. 

to 


Observing  that  x(t)  >0  and  that  the  derivatives  x^(t^) 
are  unrestricted  in  sign  for  j > 1,  we  can  now  use  Lemma  (2)  to  define 
the  natural  dual  of  problem  (l),  and  make  a statement  about  weak 
duality. 

The  dual  problem  reads  as  follows: 

Find  time  points  0 =tQ<t  <>«•  < tfc=  T,  constant  vectors 
{vJi}  C Rm,  and  a function  A :[0,T]  -»  Rm  so  as  to 


k A . . / . . 

maximize:  / A (t)  b(t)  dt  + Z Z vJ1  b^  (t.) 

0 i=l  j=0  1 

T k 

subject  toi  A (t)B  + / A (s)ds  K + Z v0iKl  (t.-t)  < c(t) 

t i=l  1 


(6) 


V0lB  + v1’ LK  <0 
vjlB  + vJ+1,1K  = 0 


t £ [0,T],  j = 0, i = l,...,k. 


(7)  Lemma.  Weak  Duality. 

Let  the  triple  (A  , {v^},  { t ^ j ) be  any  feasible  solution 
to  the  dual  constraints  (6),  and  x any  feasible  solution  to  the 
primal  constraints  (l)  that  possesses  left  derivatives  x^(t~). 
Then 

T T k i 

(i)  / c(t)  x(t)dt  > / A (t)  b(t)dt  + Z Z b^(t.) 

0 0 i=l  j=0  1 


■ mam 
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(ii)  Equality  holds  in  (i)  if  and  only  if  the  following  conditions  hold. 


For  r = 1, ...,n  and  almost  all  t € [0,T] 

(a)  xr(t)  > 0 implies 

T k 

* (t)b  + f A*(s)dsk  + Z v0ik  1 (t  -t)  =c  (t) 
t i=l  1 r 

(b)  x^t^)  > 0 imPlies 

\P^b  + v1’  ^k  = 0 
r r 

(iii)  If  equality  holds  in  (i)  then  x is  an  optimal  solution  to  (l) 

* -j  j_ 

and  the  triple  (A  , {vJ  },  ft.))  is  an  optimal  solution  to  (6). 

Proof.  This  follows  immediately  from  Lemma  (2) . □ 

Conditions  (a)  and  (b)  are  complementary  slackness  conditions. 

. i j 

In  order  to  make  them  apply  to  any  triple  (A  , {v  },  {t^})  that 
need  not  be  dual  feasible1,  it  will  be  convenient  in  addition  to  make 
a statement  about  when  we  require  the  higher  order  terms  v^^b  + v1+1' ^k 
to  be  zero. 


T 


x and  (A  , {vJ  ),  {t^})  will  be  said  to  satisfy  the  complementary 
slackness  condition  if  (a)  and  (b)  of  Lemma  (7)  hold,  and  in  addition 
condition  (c)  below  holds: 

(c)  xr(-)  is  not  identically  zero  on  the  interval  (t^  - b,  t^) 

for  any  6 > 0 implies 

v^b  + vj+1’  1kr  = 0,  1 < j < i . □ 

Note  that  if  (A  , {v*^},  {t^})  is  dual  feasible  and  satisfies 
(a)  and  (b),  then  (c)  is  automatically  satisfied. 

Lemma  (7)  provides  us  with  a useful  means  of  verifying  that 
a candidate  solution  x is  an  optimal  solution.  We  simply  pick 
any  time  points  { t ^ , find  (if  possible)  a A (•)  and  {v^} 
that  satisfy  the  complementary  slackness  conditions,  and  then  check 
to  see  if  the  triple  (A  , (v^),  (t^))  is  dual  feasible.  If  so, 
x is  an  optimal  solution. 

In. order  to  exploit  this  fact,  it  will  be  important 
to  use  the  complementary  slackness  condition  to  define  the  dual  variables. 
As  in  the  simplex  method,  and  as  we  shall  show  here,  this  can  be  done 
when  x is  a basic  feasible  solution.  For  such  an  x,  we  use  the 

-M- 

{t^}  given  by  its  induced  partition  of  [0,T],  and  then  require  A 
and  { vJ1)  to  satisfy 


T k 

(9)  A*(t)B.  + / A*(s)ds  K.ft  + Z v°rK.„  = c (t) 

pi  t pi  r=i  pi  pi 


1B.  Q + vJ+1,1K.q  = 0 


3, 


t € (ti  tj, ) for  i = 1, . . . ,k. 
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Note  that  if  X and  (v  } satisfy  (9)  then  complementary 

i 1 

slackness  holds  between  x and  (X  , {vJ  },  {t^).  However,  the 
converse  is  not  true  in  general  since  even  a nondegenerate  x may  have 
xr(ti)  = 0 for  r £ (some  r,  i)  which  then  relaxes  the 
restriction 

v0ib  + vUk  = 0 
r r 


This  will  become  important  later  on  in  Section  X.5. 

Note  also  that  in  terms  of  c(*)  and  (d^1)  defined  in  (4) 
and  (5),  (9)  reads 


(t)  = 0, 


t € (ti-l’V 


= 0 , 


0 < j < a 


for  i = 1, . . . ,k. 

We  now  show  that  (9)  uniquely  determines  X and  {v^}, 

and  interpret  the  {v*^}  as  the  coefficients  of  the  6 functional 
and  its  higher  order  derivatives,  5^. 

For  the  following  lemma  we  shall  assume  that  the  reader  is 
familiar  with  Appendix  C which  contains  definitions  of  and  facts 
about  the  {6^}  and  more  general  distributions. 


(10)  Lemma.  Let  {t^  and  {f^},  i = 1,  ...,k  be  given,  and  assume 

that  for  each  i there  is  a scalar  M such  that  (pi3.A  + k*q  ) 1 

Pi  Pi 

exists.  Then  if  l > m, 


(9)  has  a unique  solution  (A  , { vJ  )),  which  moreover  necessarily 
satisfies  = 0 for  j > m. 

(A  , { v } ) is  the  unique  solution  to  (9)  if  and  only  if  A(*) 
defined  by 

A(t)  = A (t)  + Z Z vJ  6U'(t.  - t) 
i=l  j=0 

is  the  unique  distributional  solution  to 

T 

(11)  A(t)B  + / A(s)ds  K.  = c (t),  t £ (t  ,t  ),  i = l,...,k. 

pi  t pi  pi 

Proof.  Note  first  that  if  (A  , {v^})  is  a solution  to  (9)  then  A 
defined  as  above  is  a distributional  solution  to  (ll).  By  Proposition 
C.3  (l8)  and  the  assumption  that  (^B.^  + K.  ^ 1 exists  for  some 

it  follows  that  (ll)  has  a unique  distributional  solution  of  the  form 
given  above  for  A.  Further  if  any  such  A satisfies  (ll),  then 
equating  coefficients  of  8^  on  both  sides  shows  that  the  components 
A and  {v^}  of  A satisfy  (9).  □ 

With  the  aid  of  Lemmas  (7)  and  (10)  our  initial  step  in  a 

continuous  simplex  method  is  now  complete.  We  begin  with  a basic 

#■ 

feasible  solution,  x,  and  compute  the  unique  "prices"  A and 
{v^}  after  requiring  that  (9)  holds.  We  then  evaluate  the  reduced 


(i) 

(ii) 


If 


c(t) 

IV 

o 

jOi 

d 

< 0 

a-31 

= 0 

for  j = 1, m,  i = 1,  ...,k,  then  x is  an  optimal  solution.  If 

not,  we  use  the  fact  that  one  of  these  conditions  has  been  violated 
to  attempt  to  obtain  an  improved  solution.  This  will  be  the  subject  of 
the  remainder  of  this  chapter. 


Remark  on  the  Statement  of  the  Dual  Problem 

In  general  the  dual  problem  as  stated  in  (6)  will  not  always 

have  an  optimal  solution.  This  is  because  we  may  in  general  require 

► 

more  than  a finite  number  of  breakpoints,  and  thus  more  general  distri- 
butions than  firite  linear  combinations  of  functions  and  the  5^. 
However  it  seems  that  this  will  only  occur  when  the  primal  problem  (l) 
has  an  optimal  solution  x with  infinitely  many  discontinuities 
either  in  x or  in  x^. 

In  the  space  of  distributions  the  dual  problem  thus 


reads : 

T 

maximizei  / b(t)  A(t)  dt 
0 

T 

subject  to:  A(t)B  + / A(s)ds  K < c(t)  , t € [0,T] 

t 


for  A G D',  where  the  terms  with  / signs  have  their  usual  distributional 
meanings. 

h6 


Since  the  only  signed  distributions  are  measures,  any  feasible 


X will  have  to  be  such  that  ir(t),  defined  by 

T 

7r( t)  = A(t)B  + / X(s)ds  K, 
t 

is  a measure.  This  corresponds  precisely  to  the  requirement 

v^B  + v^+l,1K  = 0,  j > 1 

in  (6),  i.e.  the  "distributional  part"  of  tt  must  vanish. 

That  a strong  duality  theorem  always  holds  for  X € d*  is 
still  to  our  knowledge  an  open  question. 


4.2.  Improving  the  Solution  When  Some  Reduced  Cost  c^(t)  < 0 

We  are  now  in  a position  to  consider  moving  from  one  basic  feasible 

solution  to  a better  one.  Assume  tnat  we  currently  have  a basic  feasible 

solution  x,  and  that  we  have  computed  the  reduced  costs  c( • ) and 

{ d^ ) after  so  lving  (9).  The  case  we  consider  here  is  where 
_ * * 

c (t  ) < 0 for  some  l and  some  t 6 (t . , t.).  Since  c is 

l J“1  0 * 

continuous  on  any  such  interval,  there  will  be  an  interval  about 

* 

t on  which  c^  < 0.  Without  loss  of  generality,  we  shall  assume 

* 

that  t =0. 

The  following  major  assumptions  will  be  made  throughout  the 
remainder  of  this  chapter. 
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(I)  The  constraint  set  is  bounded  (in  L^). 

(II)  All  basic  solutions  encountered  are  nondegenerate, 
as  defined  in  Section  2.3. 

We  shall  now  attempt  to  introduce  x into  the  basic  set 

Xj 

over  [0,e),  some  sufficiently  small  e > 0,  in  such  a way  that  a 
decrease  in  the  objective  and  a new  basic  feasible  solution  are  achieved. 
This  will  be  done  in  two  stages.  The  first  will  be  to  make  a local 
basis  change  on  [0,e)  without  regard  to  what  happens  beyond  t = e. 

The  second  stage  will  consist  of  adjusting  the  solution  on  t > e 
(if  necessary  and/or  possible)  so  that  our  new  solution  satisfies  the 
constraints. 

It  will  be  important  to  remember  throughout  that  all  con- 
structions depend  on  e.  For  clarity  of  exposition,  this  dependence 
will  not  be  written  explicitly. 

4.2.1.  Making  a local  change  over  [0,e). 

On  the  interval  [O.t^),  the  first  constant  basis  interval, 
the  current  solution  x satisfies 

t 

(12)  B.  x (t)  + K.  / x (s)ds  = b(t) 

P1  P1  P1  0 P1 

- o 

If  we  are  to  increase  x.,  then  the  new  solution,  to  be  denoted  by 

Ju 

x',  will  have  to  satisfy 


(13)  B.  x'  (t)  + K / x'  (s)ds 

“l  “l  “i  0 P1 


= b(t)  - b^e(t)  - kf  / 0(s)ds 
where  9 denotes  the  level  of  x^. 

Since  x is  a basic  solution,  there  is  a ^ such  that 

(..B..  + K.  ) 1 exists.  Thus  to  solve  for  x'  in  terms  of  9, 

P1  P1  P1 
we  can  apply  the  result  of  Proposition  C.3(l8).  However  the  expres- 
sions become  cumbersome;  instead  we  rework  the  solution  directly  using 
the  techniques  of  Appendix  C. 

Taking  Laplace  transforms  on  both  sides  of  (12)  and  (13), 

and  multiplying  both  sides  by  (B.q  + — K.  ) 1 yields 

s P-^ 


xQ  (s)  = (B.q  + 7 K.  )-1  b(s) 


‘Pi  5 01 


and 


Va)  - <Vr%  >_1  - <B'e, 


Thus 


(lb) 


h(s)  6(s) 


where 


h(s)  = (B.a  + i K.a  )_1(bfl  + i kj  . 


■Pi  s 


A 

Now  h(s)  is  an  m-vector  of  rational  function  in  s,  each  having 

/\ 

the  same  denominator.  Using  a partial  fraction  expansion  of  h(s), 
we  obtain,  as  in  C.3(l4), 
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(15) 


1 


h(s)  = 


q 

L 

k=0 


k 

V + 


P 

z 

k=l 


V1 

z 

r=0 


kr  / ,r+l 

(=-tk) 


for  (uk)  C R®,  (vkr)  C af,  (|k)  C (E. 

The  following  Lemma  allows  us  to  assume  that  there  is  some 

Uj^  4 0,  0 < k < m. 


(l6).  Lemma.  Suppose  there  is  a subset  y of  the  rows  of  B and 
K such  that  K =0  and  every  column  of  B has  a nonzero 

r.  r. 

element.  Then  uQ  / 0 where  (uk)  is  given  by  (15). 


Proof.  From  (lit),  h satisfies 


(B-R  + T K-fi  ) h(s)  = + 7 k ) 

P-^  o p-j^  Hj  S X, 


Since 


the  equations 


A 


r 

= b 


read 

rn  ’ 


By  assumption,  b ^ 0.  Substituting  (15) 
T & 

equating  coefficients  on  both  sides  yields 
as  required.  □ 


into  this  relation  and 
ByPlu0  *°>  i*e*  U0 


In  our  case  we  have  made  the  assumption  (I)  that  the  constraint 
set  is  bounded.  Thus,  if  B and  K do  not  already  satisfy  the  con- 
dition of  Lemma  (l6)  we  can  either  add  sufficiently  large  simple  upper 
bounds  on  the  variables,  or  add  a single  row 
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ex(t)  + v(t)  = M 


where  v(t)  >0  is  a slack,  variable,  e is  a row  vector  of  ones, 

and  M is  a large  constant.  In  either  case  we  obtain  the  required 

form  for  B and  K.  Accordingly,  in  (15)  we  shall  let  q be  the 

largest  integer  such  that  u ^ 0,  where  now  0 < q < m. 

Q. 

Following  Section  C.3  we  can  take  inverse  Laplace  transforms 
and  use  (l^)  and  (15)  to  obtain 

(17)  x’  (t)  = x (t)  - *E  ( E a 0(k_r-l)(O+))  5(r)(t) 

P1  P1  r=0  k=r+l 

- Z u.  e^(t)  - / \|((t-s)  e(s)ds,  t > o 

k=0  0 


where 


p V1 

(18)  \jr(t)  = Z Z 

k=l  r=0 


6 k* 


kr  r! 


t > 0 . 


Expression  (17)  tells  us  precisely  how  to  go  about  choosing 
the  appropriate  9 over  the  interval  [0,e).  The  first  point  to  note 
is  that  the  boundedness  assumption  (I)  implies  that  all  solutions  are 

functions.  Accordingly,  our  choice  of  9 will  have  to  be  such  that 

(v)  + 

the  coefficients  of  {6  ')  in  (17)  vanish.  Thus  at  t = 0 ,9 

must  satisfy 

2 uVk"r“1)(0+)  = 0 , r = 0,1,  ...,q-l 

k=r+l 
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i.e. 


V(0V) 

Vi8<0>(0+)  + V(1><0+) 


= 0 


= 0 


u1e^°^(o+)  + •••  + u e^q-1^(o+)  = o . 


Since  u / 0,  this  implies  that 

4 


(19) 


0 (0+)  = 0(1)(O+)  = •••  = fl  (tl“1)  (n+\  ^ 


9 ^ "'(0  ) = 0 


Clearly,  if  q = 0,  i.e.  the  terms  in  5^  in  (17)  are  absent, 

then  these  restrictions  on  {0V  ' (0  ))  do  not  apply.  As  mentioned 

in  Appendix  C,  a sufficient  condition  for  this  to  occur  is  that 

B.^  exists. 

S1 

Next  we  consider  what  happens  at  t = € . Since  we  are  making 
0 basic  only  over  [0,€),  we  must  have  0(t)  identically  zero  for 
t in  an  interval  to  the  right  of  e.  This  implies  that  0^(t)  = 0 
for  all  such  t > e.  However  to  meet  this  condition  without  having 
to  introduce  S's  at  t = e,  the  term 

z u,  0(k) 


k=0 


in  (17)  implies  that  0^,  k = 0,1,  ...,q-l,  must  be  continuous  at 
t = e,  i.e.  we  require 
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(20) 


0(«")  = 0(1)(€_) 


e(q"1)(e")  = o 


Once  again,  if  q = 0 then  these  restrictions  do  not  apply. 

Since  c < 0 on  [0,e),  e sufficiently  small,  we  wish 
to  have  0 in  some  sense  as  large  as  possible  over  [0, e).  Clearly 
any  feasible  0 that  is  positive  over  (0,e)  and  satisfies  the 
restrictions  (19)  and  (20)  will  yield  a local  decrease  in  the  objective. 
However,  we  need  first  of  all  to  choose  0 such  that  the  resulting 
solution  is  again  a basic  solution.  We  shall  do  this  by  thinking  of 
0^  as  a bang-bang  control  variable  that  will  alternate  over 
successive  subintervals  of  [0,e)  between  positive  and  negative 
levels  that  are  as  large  as  possible. 

In  order,  secondly,  to  satisfy  the  restrictions  (20)  we  shall 

require  at  least  q switching  times  for  change  of  sign  of  0^  in 

the  interval  (0,e).  We  shall  here  make  use  of  exactly  q such  time 

points.  Denote  these  by  t^e,  T^e,  ...  , T^e,  0 < < * • • < < 1. 

We  now  proceed  as  follows.  At  t = 0,  (17 ) reads 

x'  (0)  = x (0)  - u 0(q)(O) 

°1  P1  q 

assuming  that  (19)  holds.  We  increase  0^(0)  as  far  as  possible, 
maintaining 

x_  (0)  - u 0^  (0)  > 0 . 

P]_  q _ 


Under  nondegeneracy  exactly  one  component  of  x'  (0),  say  x*  (0), 

P1  1 

will  become  zero.  This  is  exactly  the  same  as  the  minimum  ratio  test 
in  linear  programming,  r^  is  given  by 
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(21) 


) i 


rn  - argmin  . 

( 


ty±  > o. 


where  ya  = u , y =0.  Note  that  the  boundedness  condition  implies 
q a^ 

that  has  a positive  component  since  otherwise  we  could  increase 

0^,  and  hence  x*  (0),  indefinitely. 


now 


Let  a = (3  U {£}  ~ { r^}  be  the  new  basis  at  t = 0.  We 

wish  to  maintain  this  basis  change  over  the  interval  [0,-r^e),  assuming 

for  the  moment  that  is  given.  To  do  this  we  need  to  check  the 

following:  (i)  there  is  a n such  that  (|jB.  + K.  J”1  exists,  and 

°1  °1 

(ii)  x'  (t)  >0  for  t £ [0,t  e). 

o-i 


(22)  Lemma.  If  r is  determined  by  (21)  then  there  is  a M such 

that  (mB.  + K.  )_1  exists. 

°1  CT1 


Proof.  We  shall  use  Laplace  transforms.  Let  p be  such  that,  from 

(21), 


(23) 


u 


qp 


> 0 


where  u is  the  pth  component  of  u . Write 

qp  q 


(2h)  (B.^  i KVi)  =(B.#i4yM  <»,  * i k,)-(»r  ^ J 


This  holds  by  definition  of  o ...  By  factoring  out  the  matrix 

(B.  + — K.  ) on  the  right  hand  side,  and  noting  that  (b  + — k ) 
Px  s rl  s ri 

is  its  pth  column,  we  get 


(25) 


(B  + ~ K.  ) = (B  + i K )[I  + (h(s)  - e )eT]. 
CT^  S 01  P-,  S P1  p'  pJ 


Pi  s Y 


Now  the  matrix  I + (h(s)  - e Je1  is  the  identity  with  its  pth 

P P 

column  replaced  by  h(s).  Thus  it  will  be  nonsingular  iff  h^(s)  / 0. 
From  (15)  we  see  that 


h (s)  = u sq  + lower  order  terms 

p qp 


Since,  by  (25),  u t 0,  we  obtain  that  h (s)  ^ 0 for  s sufficiently 

u-i  ir 

large.  This  implies  that  for  s sufficiently  large,  the  left  hand 
side  of  (25)  is  the  product  of  two  nonsingular  matrices,  and  hence 
is  itself  nonsingular.  □ 


To  obtain  x'  (t)  we  can  either  solve  the  equation 


B.  x (t)  + K.  / x (s)ds  = b(t) 


°1  al 


al  0 al 


from  scratch,  or  equivalently,  use  (17)  to  compute  the  0(t)  that 

keeps  x'  (t)  =0  and  then,  again  by  (17),  obtain  the  remaining 
1 

components  of  x'  . The  latter  approach  yields  the  (q+l)th  order 
P1 

integro-.differential  equation 


(27)  u efq)(t)  + u 1 0 ' q_1^ ( t)  + •••  + u 0(t) 

qp  q-J-»  p u,  P 

+ / t n (t-s)  e(s)ds  = x (t) 

« P ^ 1 


with  boundary  conditions  (19). 


. 


(28)  Lemma.  For  e sufficiently  small,  x'  (t)  >0  on  [0,T.e), 

°1  1 

any  choice  of  t^:0  < < 1. 


Proof.  Let  y = <r^  ~ [£).  Under  nondegeneracy  x^,(0)  >0.  Since 

x'  is  analytic,  it  follows  that  x' (t)  >0  on  a neighborhood  of 

T" 

-L  / \ 

t = 0.  Since  0'q'(O)  > 0 and  0 satisfies  the  boundary  conditions 
(19),  it  follows  that  x' (t)  = 0(t)  > 0 for  t 6 (0, r e).  □ 


The  next  step  is  to  decrease  0^q^  as  far  as  possible  on 

trle’  t2£^  wllile  maintaining  x(t)  >0  on  [■ r^e ,T2e).  This  will 

cause  the  variable  x that  dropped  out  of  the  basis  on  [0, t e) 

rl  1 

to  become  basic  again,  and  another  variable,  x , to  drop  out  of 

r2 

the  basis  . r^  is  given  by 


(29) 


r_  = argmax 


( xi^ie_)  I 

’ — — = — -v  < n : 


yi 


yi  < 0 


where  y is  as  in  (21) . To  see  this  note  that  the  only  discon- 
tinuity on  the  right  hand  side  of  (17)  at  t = r^e  occurs  in  3^. 

Thus  we  may  write 

q-1  , Tle 

(30)  x'  (t]€+)  =x  (t.c)  - Z uk0^(T1£)-u0^qV1e+)'/  \|<(t-s)0(s)ds 
ei  ei  1 k=0  ^ q 1 0 

= x*  (t  e")  - u [0(q)(c.€+)  - 0(q)  ( r e-)] 
p-^  1 q l 1 

from  which  (29)  follows.  Note  that 
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(3D 


'r  (T1e+)  = -uqf){e^q^(T;L£+)  - o(q)(iie')} 


(q), 


.(q), 


Since  > 0 and  0vh'(t^£  ) > 0,  and  since  nondegeneracy  implies 

0^q^(t^£+)  < 0 for  sufficiently  small  e,  (31)  verifies  that 

x'  (t ,e+)  > 0. 
r,  1 ' 


Let  c0  = U {r^}  ~ { xr  ) . This  is  our  new  basis  on 

[x^e,  t?c).  That  we  indeed  have  a basic  feasible  solution  on  this 

interval  for  sufficiently  small  e and  t0  close  enough  to 

follows  from  an  analysis  similar  to  that  given  in  Lemmas  (22)  and 

(28).  However,  it  is  possible  that  0 may  go  negative  if  is 

too  large.  It  will  be  established  in  Corollary  ( k.2 ) below  that 
the  choice  of  { that  satisfies  (32)  below  ensures  that  9 
does  not  go  negative. 

We  now  continue  in  this  way  over  the  (q+l)  intervals 

[0,T^e),  [T^e,  x2e),  ...  , T^e,€)  alternating  the  sign  of  0 q^ 

and  obtaining  (q+l)  basic  sets  (cr.)  where  <x.  , ,=<r.  U fr. } ~ fr.  , , } , 

1 l+l  1 i l+l 

i = 1,  ...,<l-l.  Each  ri  is  determined  as  in  (29),  and  under  non- 
degeneracy, will  alternate  between  r^  and  r0  for  £ sufficiently 
small.  Also  it  is  easily  checked  that  for  all  {t^}  the  x!^  remain 
> 0 except  possibly  . x'  = 0. 

X 

What  remains  is  for  us  to  find  the  points  {t^}  and  show  that 
for  this  particular  choice,  0(t)  >0  on  (0,€).  Once  this  is  done 
we  shall  have  succeeded  in  making  x'  = 0 basic  over  [0,£),  and  in 

Xj 

so  doing,  only  have  adjusted  other  basic  variables  over  this  interval. 

In  doing  this  recall  that  we  have  not  been  concerned  with  the  possibility 
of  some  basic  variable  becoming  infeasible  at  some  t > c.  This  will 
be  the  subject  of  the  next  section. 
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In  principle  we  can  compute  (e^(e-)} 
parameters  { t ^ ) and  then  solve  for  the  (t^) 


as  functions  of  the 
with  the  equations 


®(c  I Cq)  — 0 

(32)  0(1)(€_|  rr...,Tq)  = 0 

0(q_1)(e'|  cq)  = 0 


In  practice,  some  iterative  technique  will  have  to  be  used.  Below  we 
show  how  a very  good  starting  solution  can  be  obtained. 

In  the  following  let 


(33) 


<xi(°) 

P-,  = min  < : y,  > 0 

1 / *i  1 


02  =-max 


••  yi  •=  °i 


where  y is  as  in  (21) . Under  the  boundedness  condition,  both 
and  p,  exist.  Under  nondegeneracy,  both  are  positive. 


(3*0  Theorem.  For  e sufficiently  small,  equations  (32)  have  a 
unique  solution  0 < < • • • < t <1.  Moreover,  the  are 

given  to  first  order  by 

'35)  Ti  “ 1 “ *!  + 0(c) 
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where  the  z.  solve 
1 


(36) 


zl  - z2  + 

2 2 

Z1  “ Z2  + 


>q+12 
%q+i  2 


+ (-Dq  \ - p 


+ (-D4  Zq  = P 


Z?  - Z|  + 


+ (-Dq+Vq  - p 


where 


1 > z.  > • • • > z >0 

i q 


p = p1/(p1  + p2)  • 

We  shall  prove  this  theorem  using  the  following  lemmas. 

(37)  Lemma.  Let  0 < < •••  < t <1  be  any  partition  of  [0,1], 

i q 

Let  the  bases  {o^}  be  determined  as  above.  Then  as  e -» 0 


e(q)(t)  = + 0(e) 


for  t 6 [0,  rl6),  [ ^e,  r e),  ...  , and 


e(q)(t)  = -p2  + 0(e) 


for  t£  f , Tge ) , ff-e.T^e),  ...  . 


Proof.  The  result  on  [0,  r^e)  follows  directly  from  the  analyticity 
of  9 on  this  interval  and  the  fact  that  0^(0)  = p,.  From  (30), 


-• 
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8(1>(r 


1<;+)  - max  j-i-ji + tjO  t y±  < oj 


>(q) 


Since  x'  and  0vy'  are  analytic  on  [0,  x^) 

x’C^e")  = x ' (0)  + 0(e) 
e(q)(TlO  = e(q)(0)  + 0(e) 


Also,  by  definition 


Thus 


X.'  (t, e“)  , \ 

-1  1 - 


x'(0)  = x(0)  - y 9^(0)  . 

xi(o) 


yi 


+ 0(e), 


from  which 


f 0 


0^q^(T1e+)  = - p2  + 0(e)  . 


Since  0^  is  analytic  on  [ t^,  t^),  the  result  holds  over  this 
interval. 

We  may  now  continue  in  this  way  and  thereby  establish  the 
lemma.  □ 

(38)  Lemma.  Under  the  conditions  of  Lemma  (37)  0 satisfies 


(39)  9(q-J>(t)  - p.  £ + 1;1  ' "k 

J • J • 


Z (-1)  (t— t e)J  + 0(e^+1) 
k=l  k 


for  t c x^e),  i = 1,  ...,q+l,  j = 0,1,  ...,q,  where  tq  = 0, 

rq+l  = L* 
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Proof.  We  shall  expand  9 in  power  series  that  are  centered  at 


and  hold  to  the  right  of  x^e.  We  proceed  by  induction  on  i. 

For  the  case  i = 1,  observe  the  boundary  conditions  (19),  and 

write 

e(t)  = e"q)(o)  tq/q'  + o(tq+1)  . 

Differentiating  yields 

0^q"^(t)  =■  e^(o)  rVj.  + o(tj+1) 

Now  0(q)(Q)  = p by  definition.  Since  t = 0(e)  for  t £ [0,e)> 
the  result  is  true. 

Suppose  it  to  be  true  for  i.  Write 


0(t)  = £ 0(r)(i  e+)(t  - t e)r/r'.  + 0(tq+1) 

r=0  1 1 


By  differentiating,  noting  that  t and  t - xje  are  0(e),  and 
using  the  result  of  Lemma  (37)  we  obtain 


8<q-J)(t)  - 9(«-Jtr)(t,£+)  (t  - ,l6)r/ri 

r=0  1 

+ r(t  - t^/ji  + 0(ej+1) 

where 


r 


i even 
-P2  i odd 


6l 


(k) 

Using  the  continuity  of  0'  ' at  t = t e,  k = 0,1, 
and  using  the  result  for  the  case  i,  this  becomes 


(39)  6(q'J')(t) 


_ 'v1  / ~ („  \ (j-r)  //,  ^ , . ^P1  + °2^  ^ , . sk  , . i 

r^|pi(Tie)  /(j"r)*  Td  - r): 


-r 


+ 0(e^_r+1)  ) (t-Tie)r/r.*  + yCt-^e )J/<j l + 0(eJ+L) 


Now  the  following  relations  hold: 

U)  £ (T.e)^"r^(t  - T e)r/(j-r):  r.’  = 4-  (tj  - (t  - -r.e)'3 ) 

r=0  1 1 J-  i ' 

(ii)  ^T,  7 (-l)k(r  e - t e)j-r(t-T  e)t/(j-r).'  r.’ 

r=0  k=l  ik  i 

* £ (-l)k  -7T  {(t-T  e)J  - (t-T  e)J] 

k=l  J.  k x 

(iii)  *Z  0(c'j_r+1)(t  - T,€)r/r’j  = 0(cd+1) 
r=0  1 


Substituting  these  into  (39)  yields 

_ ^1  r 4.J  t * _ _\Ji  . ^ P1  + P2 ^ iv1 


'(t)  = jr  {tJ  - (t  - t e)J)  + — 1 , 2 Z (-l)k((t  - t.€)J  - (t- x.e)J‘ 

o • -l  «j  • k.=X  ^ ^ 


+ fr  (t-  TiC)J  + 0(ed+1) 

(o,  + o ) i-1  k 

= °1  17  + jT-  k?i  (‘1) 

(Pl+  Pp^1"1)  k+1  i (r-o,)  h h+1 

+ JT1—  z (-1)  (t-T.e)13  + ■ ■■  : - (t  - x e ) J + 0(e J l) 

J*  U-l  1 J • 1 
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By  now  considering  the  cases  i even  and  1 odd,  the  appropriate  terms 
cancel  and  the  result  is  established  for  i+1. 

By  induction,  the  proof  is  complete.  □ 

In  addition  to  these  Lemmas  we  shall  require  the  results  of 
Appendix  D.  These  are  concerned  with  properties  of  the  system  of 
equations  (36). 


Proof  of  Theorem  (34) . Using  (39) > equations  (32)  become 


_ j ( P-l  pj  *1  V -! 

P-,  7T  + V,-2-  E ('1}  (£  - = °(€  } 

■L  J.  J.  j-=l  K 


for  j = 1, ...,q.  Thus  the  satisfy 


(40)  Z (-l)k+1(l  - t )J  = p + 0(e), 
k=l  K 


J — U, • • • , q 


Now  by  Theorem  D(2)  the  equations  (36)  have  a unique  solution 
1 > z^  > • • • > > 0.  Also  by  Corollary  D(9)  the  Jacobian  of  the 

system  is  nonsingular  for  such  {z^}.  We  can  therefore  invoke  the 
Implicit  Function  Theorem  [20,  p.  128]  and  conclude  that  (36)  has  a 
unique  solution  for  0(e)  changes  in  the  right-hand  side.  Moreover 
this  solution  will  not  differ  from  (z^)  by  more  than  0(e).  From 
this  it  follows  that  the  solution  to  (40)  is  given  by  (35),  and 
the  proof  is  complete.  □ 


> 
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Remark..  Note  that  since  the  t\  are  determined  within  0(c),  the 
actual  breakpoints  {eT^}  are  determined  within  0(e,J). 

We  now  establish  that  0(t)  >0  on  l0,e). 

(41)  Lemma.  Let  h:R  ->  R be  continuous  and  have  a derivative  that 
has  at  most  finitely  many  simple  discontinuities.  If  h(t')  = h(t")  = 0 
for  some  t'  < t" , and  h^  (t)  changes  sign1  k times  in  (t',t"), 
then  h(t)  changes  sign  at  most  k-1  times  in  (t',t"). 

Proof.  Suppose  that  h changes  sign  r times  at  the  points 
t'  < < •••  < wp  < t" . Then,  since  h(t' ) = h(w^)  =•••=  h(wr) 

= h(t")  = 0,  h^(t)  changes  sign  at  least  once  in  each  interval 
(wi,wi+l)  where  wQ  = t',  w^^  = t" . Thus  h^(t)  changes  sign  at 
least  r+1  times  in  (t',t").  Hence  r+1  < k,  or  r < k-1  as  required.  □ 

(42)  Corollary.  0(t)  >0  on  [0,e). 

Proof.  Since  0^(0)  = 0^(e)  = 0,  i = 0,1,  ...,q-l  and  since  0'^(t) 
change  sign  q times  in  this  interval,  we  can  apply  the  Lemma  repeatedly 
to  obtain  that  0 has  no  change  of  sign  in  [0,e).  Since  0^(0)  =0 
for  i = 0,1,  ...,q-l,  and  since  0^(0)  > 0,  the  analyticity  of  0 
ensures  that  0(t)  >0  in  some  neighborhood  (0,S).  Thus  0(t)  >0 
on  f 0, e) . □ 

By  a "change  of  sign,"  we  mean  going  from  positive  to  negative  or 
vice  versa. 
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4 


FIGURE  U 


4.2.2.  Adjusting  the  basis  on  t > e 

So  far  we  have  obtained  an  x'  from  x that  satisfies 

t 

Bx' (t)  + K / x' (s)ds  = b(t) 

0 

x' (t)  > 0 

for  t c [0,e).  We  now  wish  to  determine  the  remainder  of  x'  that 
satisfies  these  relations  on  [e,T],  There  is  in  general  no  unique 
way  of  doing  this,  and  indeed,  it  may  not  be  possible  at  all.  When 
no  adjustment  to  the  right  of  e is  possible,  then  neither  is  the 
basis  change  over  [0, e),  and  we  shall  then  have  to  pick  some  other 
interval  of  perturbation.  This  aspect  will  be  considered  in  Section  4.5. 

Our  approach  here  will  be  to  find  an  x'  over  [e,T]  that  is 
close  to  x in  some  sense,  and  that  can  be  obtained  in  as  natural  a 
way  as  possible.  What  we  shall  do  is  try  to  preserve  on  [e,T]  the 
same  sequence  of  basis  changes  as  in  x,  and  only  adjust  the  timing 
of  these  changes.  This  is  the  approach  we  took  in  Example  (5)  of 
Section  5.2. 

In  addition  to  assumptions  (i)  and  (II)  of  the  previous  section, 
we  shall  now  assume  the  following: 

(III)  All  basic  solutions  are  such  that  consecutive  bases 
differ  in  only  one  element. 

This  assumption  is  a kind  of  nondegeneracy  assumption:  if  (3^  and 
Bi+1  differ  in  more  than  one  element,  then  we  can  insert  bases  of 
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zero  time  duration  between  them  to  achieve  any  two  consecutive  ones 
differing  in  only  one  element.  The  assumption  is  then  that  all  time 
intervals  are  of  positive  duration. 

In  order  to  facilitate  the  presentation  and  understanding  of 
the  material  in  this  section,  we  shall  need  to  develop  additional 
notation. 

The  superscript  will  be  used  to  denote  an  adjusted 

variable  or  parameter.  Thus  x' , {t^},...  denote  the  adjusted  x, 

adjusted  breakpoints, ....  We  remind  the  reader  that  all  adjusted 

variables  or  parameters  will  depend  on  and  that  this  dependence 

will  be  assumed  understood  throughout. 

and  r^  will  denote  respectively  the  entering  and  leaving 

variables  at  t..  Thus  = P.  U [&,}  ~ {r.}. 

1 i+1  11  l 

It  will  be  useful  to  work  with  variables  w^(*)  that  coincide 

with  xR  (•)  on  [t^  ^,t^)  but  that  continue  beyond  t^  as  if  the 
pi 

breakpoints  t^,  t^+^,  ...  did  not  exist.  Thus,  given  x on  [Ojt^  ^), 
w^  is  defined  by 


V JL  — J- 

B.Rwi(t)  + K.  / w.  (s)ds  = b(t)  - K / x(s)ds,  t > t 
Pi  pi  t.  , 0 


In  addition  let  |^(*)  and  t^(-)  be  respectively  those 

components  of  w^  and  w^+^  that  correspond  to  x^  and  x , 

i.e.  xr  (t)  = § ± (t ) on  t±)  and  x£  (t)  = n^^Ct)  on  [ t±, ti+1) 

As  in  the  derivation  of  (17)  we  can  represent  x0  on 

Pi 

[ti,ti+i)  in  terms  of  as  follows: 
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m.-l  m 


m 


xR  (t)  = w.  (t)  - Z ( Z \ +))  &(d)(t) 


j=0  k=j+l 


m.  t 

- ZV  ,W(t)  - / ♦.(t-B)  T,  (B)dS 

k=0  1 ti  1 


t € f'ti,ti+1) 

for  some  nu  >0,  (u£)  and  * ) where  these  are  found  similarly 
to  their  counterparts  in  (17).  Here  they  are  determined  from  the 
partial  fraction  expansion  of 


(B-e.+  s lt-e.)"1  (b<  + = “<  ) • 


«i  = *t 


As  was  done  for  (17)>  we  can  assume  u^  ^ 0. 

i 

Of  considerable  importance  will  be  the  behavior  of  x and 

i 

x as  they  approach  t . . Let  p . > 0 be  such  that 

Sj  . ^ 


(45) 


ip}(t.)  = 0, 


(pj 

lt  1 j 0 


o < j < p. 


and  q.^  > 0 be  such  that 

T)(j)(t+)=0,  0 < i < q. 

(46)  \ )-k  . 

T)i  (tjj  f 0 . 

The  following  Lemma  relates  and  on  t > t^. 
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(47)  Lemma.  For  each  i > 1 
(i)  mi  < qt  < mi  + p.^ 

i 

(ii)  There  are  (vk)k=:0  C R,  ^ 0,  and  cp± ( * ) such  that 
• "t  (p  ) 

(48)  Z vj  n[k)(t)  + / cp  (t-s)  n (s)ds  = | ?1  (t),  t>t.. 


Proof.  Since  the  lhs  of  (44)  contains  no  terms  in  6 J the  coefficients 

of  8^  on  the  rhs  of  (44)  must  vanish.  Since  u*  / 0 this 

“i 

implies,  as  in  the  derivation  of  (19),  that 


n(d)(t*)  = 0 , 


0 < j ■<  m 


By  definition  of  this  implies  qi  > rru . 

Since  xr  (t)  =0,  t£  (44)  yields 

mi  t 

0 = i^t)  - Z u£  T^(t)  - / \|r  (t-s)  qi(s)ds 


where  j is  such  that  r^  is  the  jth  component  of  0 . Differentiating 
the  right  hand  side  p.^  times  and  noting  (45)  yields 


mi+Pi 


• * i (*)  t (PJ  (P, ) 

■is  VPi,J  ^i  (t)  + l +ij  (t-sJ-^^BjdS  = li  ' 


y 4 (k) /.+. 

.LVr.j'l  <V 


0 < r < p. 


(50) 


VP1  . 


_ . (Pi’ . 


L Vp„i  ii  <v  -«  (V  ^ 


where 


*5'1)<o>  • 


1 < k < p. 
— — 


From  (50)  there  is  some  k such  that  / 0.  Let 

k-p^, J 

q^  be  the  largest  such  k.  We  show  that  q = q^.  Clearly, 
nu  + p^  > q^  > q^,  by  (46).  Suppose  that  q > q. . Setting 
r = q.  - m.,  . p -1  in  (49)  and  noting  (46) 


u ^ . = 0 

qi-Pi+l> 0 


But  u /0  and  nn  > q^  - p.  > q.  - p + 1 yielding  a contradiction. 

1 1 1 

1 i i , (Pl) 

Setting  v = VL  .,  0 < k < q.  and  qp  (t)  = i|r  (t)  completes  the 
K -K-p^jJ  1 1 1J 

proof.  □ 


Below  we  sketch  the  cases  in  which  p.  and  q.  are  one  of 

i l 

0,  1,  or  > 2.  Any  of  the  pairs  (p^q.)  is  possible. 


Remark.  Note  that  p^  and  q^  are  dependent  on  t^,  or  rather,  the 
behavior  of  ^ and  at  t^.  However  m1  and  the  relation  q^m 
do  not  depend  on  t^.  rm  is  determined  solely  by  the  coefficients 

B-ei’  \ “d  \ 

We  are  now  in  a position  to  consider  the  adjustments  of  the 
t^  in  order  to  obtain  the  desired  x . The  construction  will  be 
done  inductively  on  i. 

For  i = 0,  the  adjustment  has  already  been  made:  t’  = e. 

Analogously  to  (43),  given  adjusted  time  points  t^,  t^,...,t.'  , 
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and  x'  defined  on  by  the  bases  o on  [ -r^,  e,  T;.e) 

j = l,...,q+l,  and  p.  on  [t| ' ,t»),  j = we  can  define 

J 0 J 

w’  by 

t *1-1 

(51)  B.~  w?(t)  + K.  f w! ( s ) ds  = b(t)  - K / x'(s)ds,  t > t»  , 

Pi  1 Pi  t!  . 1 0 1-1 

l-l 

In  attempting  to  find  t!  we  shall  try  to  satisfy  the  follow- 
ing: 

(i)  t!  -*  t.  as  e -»  0 . 

1 1 

(ii)  w[+^  may  not  contain  6*  s. 

(iii)  On  neighborhoods  to  the  left  and  right  of  t!,  we  must  have 

w^(t)  > 0 and  w’+1(t)  > 0 respectively. 

(iv)  w'  must  be  ’close’  to  w.,n  in  order  that  future  breakpoints 
' l+l  1+1 

can  be  adjusted. 

We  begin  by  considering  w|(t)  - w^(t)  for  t > t^.  From 
(17)  and  the  construction  of  0,  it  is  clear  that 

e 

(52)  w’ (t)  - w,  (t)  = / \j((t-s)  e(s)ds  . 

0 

The  following  lemma  gives  an  important  estimate. 

(53)  Lemma.  There  is  an  M > 0,  independent  of  e,  such  that 

f 9( s)ds  = Meq+1  + 0(eq+t ) 

0 

where  q is  as  in  the  previous  section. 
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Proof.  From  (39)  we  have  that 

4-  Q ( P-i  Pp)  k-  n n+1 

9(t)  = P-^  + ^7 — “ ^ (_1)  (t  - Tte)q  + 0(eq  ) 


k=l 


for  t£  [xi  .e,  -r^e).  Integrating  over  (0,g)  and  simplifying  yields 


/ «(t)dt  = (Pl*02)  (-D^dTtp^l^r 


+ o(ei+2) 


By  Lemma  D(3)  and  Theorem  (34)  the  term  in  { } is  equal  to 


P-i  <1 

T— - n.  U - O +o(6) 


°1  p2  k=l 


where  the  z satisfy  (45)  and  are  independent  of  g.  Thus 

K 


e ^ q+1  n+P 

f «(t)dt  = PL  ^ (l  - zk)  + 0(61  ) 


Since  1 > z,  > • • • > z >0  and  pn  > 0,  setting 
i q 1 


m * P,  n (i  - z, ) / ( q + i): 

1 k=i  k 


gives  the  desired  result.  □ 


(54)  Corollary.  Let  g:[0,»)  -»  R be  such  that  for  all  t £ [0,g) 


g(t)  = g(0)  + 0(g)  . 


Then  there  is  an  M > 0 such  that 


/ g(s)  e(s)ds  = g(0)  Meq+1  + 0(eq+2)  . 

0 

Proof.  This  follows  immediately  from  the  Lemma.  □ 

We  apply  this  corollary  to  (52).  Since  w^,  w1  and  are 
analytic,  differentiating  (52)  yields 

(w'  - w )^(t)  = / \|r^  (t-s)  S(s)ds,  j >0  . 

0 

By  Corollary  (5b)  we  obtain,  for  i = 1, 

(55)  (wi  - w.)(j)(t)  = 0(eq+1),  j > 0 . 

Thus  not  only  are  and  w|  close  together  but  so  too  are  all  their 

higher  order  derivatives.  It  will  be  important  that  the  pairs  w^,  w;, 
inherit  this  property  as  well  as  the  same  order  of  magnitude  difference 
eq+1. 

Let  us  now  make  the  induction  hypotheses  that 

t£_^  tp  p as  e -»  0 and  that  w^  and  w!  satisfy  (55)* 

Under  nondegeneracy,  we  have  that  w. (t)  > 0 for  t € (t  ,,t 

i l-l  i‘ 

Preassign  any  small  5 > 0.  Then  by  this  remark  and  the  induction 
hypotheses  there  is  a r > 0 such  that  for  all  e sufficiently  small, 

(56)  i[(t)  > r for  t £ tti_I  + 6,  t - 6]  , 

for  t £ [ti  + 5,  ti  + 25]  , 

7b 


i > 1 


(57) 


|q(t)|  > r 


and  all  other  components  of  wd  are  > y for  t £ [ ^ + 8,  t^  + 28] . 
Relation  (57)  follows  from  the  fact  that  since  is  analytic,  it 

has  at  most  one  zero  in  ft.  - 5,  t.  +8],  and  is  nonzero  on 
[t.  + 6,  t.  + 28]. 

In  Figure  6 below  we  illustrate  the  possible  cases.  The 

dotted  curves  below  are  l^(*)  and  the  solid  curves  are 

It  is  clear  from  these  figures  that  a good  candidate  for  t!^ 

is  either  t!  = t.  if  5!  has  no  zero  in  [t  - 8,  t + 8],  or  t! 

iii  ii  i 

equal  to  the  leftmost  zero  of  in  [t^  - 5,  t^  + 5],  We  shall 

show  that  this  is  indeed  the  case  whenever  adjustment  of  t ^ is 
possible.  What  determines  this  is  whether  or  not  and  | ^ can  be 

related  analogously  to  (US). 

The  following  is  the  main  result. 

(58)  Proposition.  Adjusting  t^. 

Assume  the  induction  hypotheses  that  t£_^  -»  t^_^  that 

w!^  and  w^  satisfy  (55). 

From  the  above  discussion  we  have  that  for  all  e sufficiently 
small,  the  number  of  zeros  of  in  [t  - 8,  t^  + 8]  is  constant. 

Determine  t^  as  follows:  if  there  are  no  zeros  in  this  interval, 
let  t^  = t^  else  let  t^  be  the  leftmost  such  zero.  For  this  t| 
define  w^+1  as  in  (51).  In  addition  let  p^  > 0 be  such  that 
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t'(j)(tp  = 0 , 0 < J < p- 

(p[) 

q (t* ) t o . 


Then,  if  = p^,  the  following  hold: 


(i) 

t!  - t.  = 0(6q+1)  if 

P,  > o 

i 

(ii) 

(w'  - w )(J)(t)  = 

V l+l  i+l;  1 ; 

0(eq+1)  for  all  t > max{q,q) 

and  j > 0 

(iii)  wi+1(t)  >0  for  t £ [q,  ti+1  - 6]. 


Proof. 

(i)  Write  |!(t)  = £ . (t)  + y(t).  Differentiating  this  p.-l 

1 (Pi-1)  1 

times  and  expanding  q about  q yields 


(P.-l)  (P.-l)  (P.)  p 

q (t)  = y (t)  + q (q)(t-q)  + o((t-t.)2) 


Now  from  (55),  (56),  and  (57)  it  follows  chat  t!  -»t.  as 

(Pi“l)  (Pt)  1 1 

e -» 0.  Further  q (q)  = 0 and  q (t  ) is  independent 

of  e . Thus 
(P,“l) 

y (q)  (p.) 

(q  q)  “*“6i  (ti)  * 0 as  €_*°- 

(pi-L)  a+x 

Since,  by  (55),  y (t«)  = 0(eq  L)  , 


q - q = 0(eq+^)  as  required. 


(ii)  Since  p!^  = p^,  it  is  clear  from  Lemma  (I47)  that  | ^ and  rj! 
satisfy 


(59) 


(60) 


qi 

Z 

k=0 

and 


qi  , * (pj 

Zv7  T].'  (t)  + / Cp,  (t-s)  T)'  (s)ds  =f'  1 (t)  , 

-n  K 1 1 1 


t! 

1 


t > t! 
— 1 


,'(k)(tp  =0,  0 < k < q. 


K^t!)  f 0 . 


We  wish  to  solve  for  n.  and  n!  in  terras  of  f . and 

'1  1 

5:,  respectively.  Since  the  equations  ( hQ ) and  (59)  have  the 
same  coefficients,  and  since  both  t) ^ and  q have  their 


first  q^-1  derivatives  initially  zero,  it  is  easily  seen 


(by  using  Laplace  transforms,  for  example)  that 


„ (P,)  t ^ (p  ) 

ni(t)  = vili  (t)  + / cp.(t-s)  q (s)ds 


and 


l((t)  = v.q^i^t)  + / q(t-s)  1 (s) 


ds 


t! 

1 


for  some  and  Differentiating  these  yields  for  j > 


( j) , 


(61)  (r,*  - q)VJ,(t) 


^(q-q) 


(Pj+j-l-k) 

(t)  + Z qV*'(0)(q  .)  (t) 

k=0 


(P,+J)  j"1 

1 (t)+  Z 


^ ~( i)  (Pi) 

+ / T)j  ; (t-s)(q  - q)  (s)ds  + Qi(t) 


t* 

1 


= 0 iff  qi  > 0. 
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(62) 


for  all  t > t*.  where  t*  = maxft'  t.)  and 

— l 1 l l J 


t! 


1 (p, ) 

)=-/  <P£j;(t-s)  (s)ds  if  t£  > t± 

t . 


and 


(p«) 

fi.(t)  = ; cp)j;(t-s)  | ! (s)ds  if  t!  < t.  . 
1 ^ 1 *1  1—1 

i 


Since  - ti  = 0(eq+1)  and  <p^(*),  ^ 1 (')  and  l'  1 (•) 
are  bounded,  we  have 


'(j), 


(P«> 


(pJ 


Mt)  = 0(eq+1) 


By  (55)  we  also  have  that  all  the  remaining  terms  of  (6l)  are 
0(eq+1).  Hence 


(ni  - t^) (j) (t)  = 0(eq+1),  J > 0,  t > t*  . 

Now  since  the  components  of  w^+^,  except  r^,  are  all 
components  of  x^  for  t £ [t^  t.^  ),  we  see  from  (hh)  that 

for  t > t*, 

— i 

(w[+l  ~ wi+1)fc  (t)  = (w!  - w^^t)  + finitely  many  terms  in 

(t]'  - Tii)(d)(t) 

where  k runs  over  all  components  of  w^+^  except  for  t}^. 

By  (62)  and  (55)  this  implies  that 
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<"i+i  - Vi>(d>(t>  ■0<sqtl) 

for  t > tt,  j > 0,  as  required  for  (ii). 

(iii)  This  follows  immediately  from  (i)  and  (ii)  and  since 

(O 

ti.  (t.)  > 0.  □ 

Since  the  induction  hypotheses  of  the  Proposition  were  shown 
to  he  true  for  i = 1 we  can  now  inductively  perturb  the  breakpoints 
t^  provided  at  each  step  we  have  p^  = p^.  When  p^  = 0 or  = 1, 
it  can  be  easily  seen  that  this  is  always  the  case.  However,  when 
p^  > 2,  we  only  obtain  p!^  = p in  the  cases  (iii)  and  (v)  of  Figure  5. 

The  remaining  cases  in  Figure  5 for  p^  > 2 all  have  p^  < p^.  When 
this  occurs,  statements  (i),  (ii)  and  (iii)  of  the  Proposition  need 
no  longer  hold.  This  may  result  in  infeasibilities  or  in  very  large 
perturbations  further  on.  As  we  shall  see  in  the  next  section,  state- 
ments, (i)  and  (ii)  are  crucial  in  establishing  improvement  in  the 
objective. 

The  only  breakpoint  requiring  special  attention  is  the  endpoint 
t.  = T.  If  the  effect  of  9 is  to  move  t£  to  the  right,  then  we 
simply  truncate  the  new  solution  at  T.  However,  if  t£  moves  to  the 
left,  then  the  attempted  basis  change  will  only  be  possible  if  there 
is  some  feasible  extension  of  x beyond  T.  As  will  be  indicated 
in  the  next  section,  even  when  we  can  move  t£  to  the  left,  we  may 
not  be  able  to  obtain  a decrease  in  the  objective  value. 
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In  the  event  that  we  get  ’stuck'  at  the  endpoint  or  at  some 
t^  because  p!^  < p^,  it  may  be  possible  to  take  advantage  of  an 
ambiguity  in  the  dual  prices.  See  Section  4.5 . 


4.2.3.  Proof  that  x'  is  an  improved  solution. 

Assuming  that  the  adjustments  of  the  breakpoints  on  t > e 
were  successful1  we  now  have  a new  basic  feasible  solution,  x'. 


specified  by  the  bases  on  [t^  x.e)>  and  0^  on  [t^  ,t£). 

To  show  that 


T T 

/ c(t)  x'(t)dt  < / c(t)  x(t)dt 
0 0 


we  shall  show  that  the  decrease  obtained  over  [0,e)  dominates  any 
possible  increase  due  to  the  shifting  of  the  breakpoints.  Of  course, 
as  discussed  in  Section  3.2,  if  none  of  the  breakpoints  have  shifted 
the  result  is  immediate. 

The  case  where  < T is  special  and  will  be  considered 
separately  at  the  end  of  this  section. 

The  change  in  the  objective  may  be  found  using  (3)  and  the  fact 
that  complementary  slackness,  (8),  holds  for  the  pair  (x,A).  The 
expression  so  obtained  is 

T T 

(63)  / c(t)  (x' (t)  - x(t)  )dt  = / c(t)  x’(t)dt-I  I dji  x’(j)(t7)  . 

0 0 i j 

1i.e.  p^  = Pj,  all  i,  and  t^=  T,  where  t^  = T. 
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We  need  to  show  that  the  right-hand  side  is  negative  for  e sufficiently 
small.  Define 

e 

h (e)  = / c(t)  x' (t)dt 
0 

T 

hg(e)  = / c(t)  x’ (t)dt  - Z Z dJi  x' ^ (t7)  . 
e ij 


(64)  Lemma.  There  is  an  M < 0 such  that 
h^(e)  = Me^+1  + 0(e^+2) 


Proof.  By  complementary  slackness 

€ 

h (e)  = / c (t)  0(t)dt  . 

1 0 1 

Since  c^(*)  is  analytic  on  a neighborhood  of  t = 0,  and  c^(0)  < 0, 
we  can  apply  Corollary  (54)  and  obtain  the  result. 


We  next  show  that  h2(e)  = 0(eq+2) . By  complementary  slackness 
and  Proposition  (17)>  we  can  break  down  this  expression  into  contri- 
butions due  to  tj  moving  left  or  right.  We  obtain 


(65)  h2(e) 


z 

such  that 
< t. 


t. 

(/  Cg  (t)  n[(t)dt  - Z d^1 

i d i 


+ Z / C (t)  | ! (t)dt  . 

i such  that  t.  i 

4.  ^ 4.  f ^ 


(t.)) 
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Note  that  the  terms  in  t!^  = t^  make  no  contribution.  Thus  for  all  i 
appearing  in  (65),  we  have,  from  the  construction  of  the  t!,  that 
Pi  > o. 

(66)  Lemma.  For  all  i such  that  t^  < V, 

t! 

/ Z (t)  ?'(t)dt  = 0(eq+^)  . 

t.  i 
1 

Proof.  |£(t)  = (|^(t)  - ^(t))  + ^(t).  By  Proposition  (58) 

|!(t)  - |.(t)  = 0(eq+1)  . 

Expanding  | ^ about  t^  yields 

6 ±(t)  = o((t  - ti)Pi)  . 

By  Proposition  (58) 

N - h ■ °<*q+1>  • 

Since  p^  > 0,  combining  the  above  establishes  the  Lemma.  □ 

Before  continuing  we  need  to  establish  an  important  connection 
between  the  reduced  costs  immediately  before  and  after  the  basis  change 
at  tp  ti  < T. 
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(67)  Lemma.  Let  p be  such  that  ((3^)  = i\.  Let  > 0 be 


such  that 


c(j)(t!)  = 0 , 
r.  i 
l 

_(n.)  + 

c (t. ) ^0  . 
r.  ' l ' 

l 


0 < j < n, 


Let  h^(s)  be  defined  by 


Vs>  - (%1  + ?K'e1>’1  (b*  + s V 


and  let  to . and  n.  be  such  that  the  leading  term  in  the  Laurent 

£ & „ 

A 

expansion  of  hg  (s)  is  05^s  . Then 

^ -i  5 *1  movf  O n _ n 1 


(i)  d^  =0  for  j > max{0,  n^  - n^) 


(ii)  if  n^  - n^  > 0,  d£ 


(iii)  if  n^  - ru  < 0, 


_n  -n  -l,i  n (H  ) 

1 1 = (-1)  1 05,  5 1 (t!) 

x ' Hr.  x l 
l 


(5  -n  ) (n.-n  ) n (n.) 

= / * ct?  ■ = £ 1 * (<>  - t-n  v,;  <<)  • 


1If  no  such  n.  exists  then  c (t)  is  identically  zero  to  the  right 

ri 


Set  n^  = 0. 


-''nT’V; 


of  ti. 


■v 


Proof. 


We  shall  use  Laplace  Transforms. 


A(t)  B p 


A(s)ds  K.p 


By  (ll),  A satisfies1 
T 

(t)  - / A(s)ds  K.p 


for  t £ (t^  ^,t^)  in  the  distributional  sense.  For  convenience  we 
shall  reverse  the  time  direction  and  shift  the  time  origin  to  t^. 
Define  u on  [0,<»)  by 


t 

(68)  n(t)  B + / u(s)ds  K.  = g (t),  t >0 

Pi  0 pi  pi 


where 


T 

g(t)  = c(t.  - t)  - / A(s) ds  K 
t. 

l 


and  we  assume  that  c(*)  has  an  analytic  extension  from  [0,°o)  to 
(-00,00).  Clearly 


p(ti  - t)  = A(t)  for  t •£  (ti_1>ti)  • 

Further,  it  is  easily  seen  by  definition  that  -d^1  is  the  coefficient 
of  5^  in  defined  by 

t 

(69)  g£(t)  = g^(t)  - p(t)b^  - / n ( s ) ds  k£  , 

and  that  c.(t)  is  the  functional  part  of  g (t.-t). 

& Jo  1 

^Strictly  speaking  we  should  write 

T T 

/ A(s)ds  as  / A*(s)ds  + S v 
tj,  t^  k>i 
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.*»  ' 


Taking  Laplace  transforms  of  (68)  yields 


M = gR  (s)  (B  + i K )_1 
Pi  Pi  s Pi 


and  substituting  into  (69)  yields 


A A /\  A 

ig(s)  = g^(s)  - gp> (s)  h^(s) 


We  now  wish  to  relate  g^(s)  to  the  reduced  costs  taken  with 
respect  to  the  basis  Denote  these  with  the  superscript 

Thus  we  define 


where 


5i(s)  = g^(s)  - g (s)  h*(s) 
pi+l 


h|(s)  = (B  + i K )_1  (b  + i k ) . 

1 Pi+1  S Pi+1  1 s Z 


We  shall  show  that 


(71)  g^(s)  = g*(s)  - g£  (s)  hip(s)  . 


Note  first  that  h.(s)  and  h*(s)  are  related  by 

i/  i/ 


Vs)  = h!j(s)  ' h?.j(s)  hlP(s)/hJ.p(s)'  J ^ p 


A /V 


Vs)  • hsP<s)/h;lP(s)  • 


This  can  be  easily  shown  by  relating  (B.0  + — K.  )-1  and 

&i  s 


(B.  + ^ K.  ) \ Rewrite  (70)  as 

Pi+1  s Pi+1 

5/s)  = g*(s)  + gp  ^(s)  h*(s)  - gp_(s)  h^(s) 

A A 

and  gp.(s)  h^(s)  as 

gPi(s)  h/s)  = ep.+i(s)  Vs)  + (gr_(s) -g^(s))  hip(s). 

Combining  these  two  yields 

(73)  g/s)  = g*(s)  + (g^  (s)  - gr  (s))  hip(s)  + gp_  (s)(h  *(s)  -h^s)). 
From  (72)  we  have  that 

hhis)  ■ Vs)  =h?.d(s)  Vs)'  j ^ 
h!p(s)  - Vs)  = (hJ.p(s)  - v Vs)  • 

Substituting  this  into  (73)  and  rearranging  yields  (71) . 

From  (71)  we  can  now  establish  the  lemma.  Observe  that 
g^C^  - t),  t < ti?  is  the  reduced  cost  c^(t),  t < t , had  there 
been  no  basis  change  at  t..  Hence  g*(t.  - t)  is  an  analytic 
function  and  we  obtain 

(-1)J  g|(j)(0+)  = cp}(t^)  . 
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Applying  Lemma  C.4(l9)  to  (71)  yields 


Us)  = Z (-1)J  ^><<)/s3+1  - ( Z <-!)•>  V-) 

j=0  0=0  i 


\i  Mt*+\  /J+1N 


Inspecting  this  expression  we  see  that  there  are  no  terms  in  sJ  for 

j > max{0,  n.  - n.).  Hence  d^1  = 0 for  such  j,  and  (i)  is  established. 

1 n -n  -1  n.+l  _(n.) 

If  n^  - ni  > 0,  the  coefficient  of  s is  (-1)  1 03^cr  1 (t^), 

establishing  (ii).  If  n - ii.  <0,  the  coefficient  of  s ^ 1 is 

Xj  X 

(-1)^  c|^(t^)  for  0 < j < ni  - ry  so  that  (t")  = c^(t^). 

-(n.-n^)-l 

Further,  the  coefficient  of  s is 


n.-n  (n.-n„)  , n.  (ii.) 

(-D  1 1 (t!)  - (-D  Vs  1 (<) 


l ri  ' V 


so  that 


5 „ 1 1 t.) 
i i 


_(n  -n  ) n _(n  ) 

C (t.  )-(-l)  ^ 05  C (t.  ) 

Si  ' i H r.  l 

l 


as  required  for  (iii).  □ 

(74)  Corollary. 

(i)  d^1  = 0 for  j > max{0,  q^^  - p - nj 


(ii)  if  q.±  - pt  - y > 0, 


VVV1’!  , ,.“1  1 -(Si>  +. 

d«.  ■ '-1'  V cr.  <‘l> 

i ^i  l 
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■ * -.v'  • 


(iii)  if  ” “i  5 0 > 


and 


S<J)(t-)  - o , 
1 


0 < j < n.  + p.  - q. 
— 1 1 x 


(n  +p  -q.  ) _ <L-P,+1  i + 

C 1 1 1 (t.)  = (-1)  11  V1  c 1 (t  ) 

Z . l ' q.  r.  v i ‘ 

l l 


where  is  given  in  Lemma  (47), 


Proof.  From  the  proof  of  Lemma  (47)  it  can  be  easily  shown  that 

n.  = q.  - p.  and  that  as  = v1  . Since  x„  is  basic  on  [t.,t..,), 
l ^ i i iL  q.^  £.  l l+l 

it  follows  by  complementary  slackness  that  c (t)  = 0,  t £ [t.,t  ). 

Xj  ^ 1 i rl 

Hence  cfPpt?")  = 0 for  j > 0.  Applying  the  lemma  yields  the 
i 

result.  □ \ 


We  now  continue  with  our  proof  that  h2(e)  = 0(e^+2). 
(75)  Lemma.  For  all  i such  that  t£  < t 

/ 1 (t)  Tj'(t)dt  - E af  ri!(j)(t.)  = 0(£q+2). 

t^  i 0 i 

Proof.  Since  t^  -*  t^,  and  since 

(iJ  (<l) 

^i  (ti)  ^ 0 as  e -» 0 , 

we  may  write,  by  (60), 

n[u,(ti)  = 0((t*  - tt)  ),  0 < j < q.  . 
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Since  t!^  - t^  = we  obtain 


Thus 


(d) 


TirU)(t.)  = o(e  1 ). 


l[VJ;(ti)  = 0(eq+2)  when  0 < j < ^ - 2 . 
By  Corollary  (7*0  (i),  since  p.  > 0, 


d^1  = 0 for  j > max{q.  - 1,  0}. 

h 1 


Thus 


saf  q(«(ti)  = o(^+2> 


J i 


We  next  show  that 
t . 

/ 1 5 (t)  q’ (t)dt  = 0(eq+2)  . 

t!  \ 1 

1 

If  q^  > 1 it  follows  since  t'  - t.  = 0(eq+'1')  and  since 

ri[(t)  = 0((t*.  - ti)  ).  Suppose  that  q.^  = 0.  Then  since  p > 0,  we 

have  by  Corollary  (7*0(iii)  that  c (t)  = 0(t!  - t.).  Since 

1 x 

t|  - t = 0(eq+  ),  we  are  done. 

This  completes  the  proof.  □ 


Combining  Lemmas  (66)  and  (75)  with  (65)  shows  that 
h2(e)  = 0(eq+2). 

We  now  state  our  main  result. 
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(76)  Theorem.  Decrease  in  the  objective 

Let  x'  be  constructed  as  in  Sections  4.2.1  and  4.2.2.  Then 
there  is  an  M < 0 such  that 


T T 

/ c(t)  x'(t)dt  - / c(t)  x(t)dt  = + 0(cq+2)  . 

0 0 


Proof.  This  is  immediate  from  the  above  and  Lemma  (64) . □ 


In  concluding  this  section  we  consider  the  case  t'  < T.  This 

K 

can  only  occur  if  some  (T)  = 0,  r^  £ p^»  and  x has  a feasible 
extension  on  a neighborhood  to  the  right  of  T.  Let  i and  q refer 

to  the  new  basic  variable  beyond  T,  in  keeping  with  and  q^  at 

the  other  breakpoints. 

The  reason  why  this  is  special  is  that  Lemma  (67)  does  not  apply 

at  the  endpoint  t,  = T.  In  the  event  that  q > 1,  this  means  that 

“ "i  k. 

we  may  have  d^  / 0 for  j = q - 1,  and  therefore  that  Lemma  (75) 

\ k 

is  no  longer  generally  true.  Following  the  proof  of  Lemma  (75),  we 
see  that  this  could  yield  a positive  contribution  of  0(€q+1)  to  the 

-V1,k 

objective  value  if  d < 0.  If  q = 0,  we  may  have  c (T)  > 0, 

■*k  K \ 

with  the  same  effect. 

-qk"1,k 

From  an  algorithmic  point  of  view,  when  d < 0 if  q > 1 

J /,  It  “ 

k 

or  c (T)  > 0 if  q = 0,  we  suggest  first  finding  an  alternative 
\ K 

extension  beyond  T for  which  these  do  not  hold.  Should  this  not  be 
possible,  simply  allow  t£  to  adjust.  If  a net  increase  has  been 
obtained  then  the  whole  iteration  must  be  disregarded  and  we  either 
select  some  other  interval  of  perturbation  or  use  the  technique  of 
Section  4.5. 
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Note  that  the  question  of  adjusting  t^  will  in  any  case  only 
occur  at  isolated  iterations. 

4.5.  Improving  the  Solution  by  Adjusting  a Breakpoint 

This  section  considers  the  adjustment  of  some  breakpoint  t^ 

to  obtain  an  improved  solution.  As  in  Section  4.2  we  begin  with  a given 

basic  solution  x and  its  associated  reduced  costs  c(-)  and  {dJ1}. 

The  only  case  we  consider  here  is  that  of  p.  = 0,  i.e.  x (tT)  > 0. 

i 

The  reason  for  this  is  that  adjusting  t.  when  x (t7)  = 0 (i.e. 

i 1 

p^  > 0)  usually  causes  the  occurrence  infeasibilities,  or  changes  on 
t > t^  that  have  greater  orders  of  magnitude  than  the  adjustment  at 


That  an  improvement  can  be  obtained  by  adjusting  t^  will  be 

signaled  either  by  one  of  c (tT)  < 0,  c (t.)  < 0,  or  by  some 

I* . 1 Jc>  • 1 

..  1 1 
d^1  ^ 0.  While  the  former  is  a special  case  of  that  considered  in 
i 

in  Section  4.2,  it  is  much  more  convenient  to  think  only  in  terns  of 
adjusting  a breakpoint. 

Throughout  this  section  i will  be  held  fixed,  and  0 < t.  < T. 

We  shall  let  t!  be  one  of  t.  - e,  t.  + e for  e > 0 and  small,  an 
i i i 

then  adjust  the  remaining  breakpoints  tr  > t^  in  the  same  way  a 
done  in  Section  4.2.2.  All  notation  will  be  as  in  the  previous 


^Cf.  Remark  after  Proposition  (58). 
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A 


5-79 


Proof. 

(i)  By  definition  cf  n^. 


Also 


t iCt)  = xr  (t“)  + cKt-t^  . 


Substituting  these  into  (78)  and  integrating  yields  (i). 

(ii)  By  Lemma  (47),  | ^ and  are  related  by 
qi  . , . t 

£ vfcV  '(t)  + / q).(t-s)  i)l(s)ds  = | (t) 

k=0  K ti 

for  some  (vfj  and  9 (•).  Since  |.(t')  >0  for  e 
k 1 i i 

sufficiently  small,  and  since  T)!^k^(tp  = 0 for  0 < k < q^, 


we  obtain 


i (<k) 

v ^ ^ (tp  - 6i(tp  . 


Expanding  |^(*)  about  t^  yields 
i (qi^ 

v tj»  (t')  = x (t  ) + 0(e)  . 
yi  i 

Expanding  t)'(*)  about  t^  yields 

q -k 

(k)  (O  (t-t!)  q.-k+l 

ii  <t)-n;  (tp  (q^t),  o((t-t-p  ) 

for  0 < k < q^  . 


Hence 


(82)  v*  T,'(k)(tl)  = 


- € 1 Vk+1 

Xri(ti)  Xq^kT!+0(e  > 


for  0 < k < q^ 


r.u.y  t- 


- *\c>*  • 


Next 


dj1  In  (77)  is 


, apply  Corollary  (74).  If  q.±  - n±  > 0,  then  the  leading 


jVV1'1  i -(5i)  +. 

% - (-1)  \ \ <V  • 


From  (82)  we  then  have 


_q.-n.-l, i (q  -n,-l) 

(S3)  -d^1  i,J  (<) 

°i+1  + 

= (-1)  c (t  ) x (t  ) 

ri  1 ri  1 


e 


n +1  - 

i n,  +2 


(V1)! 


+ 0(6  1 ) 


n.+2 


Also  by  (82)  it  is  clear  that  all  other  terms  in  (77)  are  0(e  1 ). 

Hence  g^e)  is  given  by  the  right  hand  side  of  (83). 

If  - ni  < 0 then  all  the  d^  = 0,  and  we  may  write, 

1 

by  Corollary  (74), 


C,  (t)  = (-1) 


V1  1 -<si> 


v«i 


t \utJ  + / n -q.+l\ 

'1%  OtV 1 1 ) 

1 i (n±-q) 


for  t!  < t < t, . Multiply  this  expression  for  c (t)  by  that  given 
i ” ^ 

for  q^(t)  in  (8l)  with  k = 0.  This  yields 


Cg  (t)  T^(t) 


, A+1  i ^i)  -(5i)'  + (t-t))<li  (t.-t)^  ^ n.+l 

= (-1)  v1  n*  (tn  c 1 (t!) ± i + o(e  1 

qi  1 1 ri  1 q,:  (51  - q,)! 


for  tj  £ t £ tj. 
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I 


r 


evaluate  / c (t)  r)!(t)dt  observe  that  since  t ! = t -€, 
t!  i 1 i i 


the  value  of 


% n -q 

/ (t-t>)  (t.-t)  1 Ldt  . 

*i 


is  proportional  to  a beta- function,  and  indeed  evaluates  to 


qi:  Si+1 

(S1  + 1): 


This  then  yields 


/ 1 (t)  TiJ(t)dt 

t£  l 1 

ii.+l  , (q, ) (n  ) ni+1  5 +2 

= (-D  \ ^ (tp  c 1 (t  )-§ + 0(e  1 ) . 

qi  1 1 ri  1 (5i+i): 

Using  (80) , we  obtain  that  g^e)  is  equal  to  the  right-hand  side 
of  (83). 

We  have  thus  shown  that  g^e)  is  given  by  the  right-hand  side 
of  (83)  for  all  q^  n^  By  (i),  (ii)  now  follows  immediately,  and 
the  proof  is  complete.  □ 


This  lemma  tells  us  precisely  what  to  expect  locally  when 

adjusting  the  breakpoint,  and  only  requires  information  about  the 

behavior  of  c at  t = t . . We  summarize  our  conclusions  in  Table  2 
ri  1 

below. 
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Table  2.  The  Local  Contribution  to  the  Objective  by  Moving  t^ 
to  t^  - e and  t^  + e.1 


_(n  ) + 

c (tT)  > 0 

r^  ' i 

— m 

.(*»«)  + 
c (t  ) < 0 

ri  1 

even 

n1  odd 

even 

n^  odd 

t-  = ti  - 6 

- 

+ 

+ 

- 

tl  = ti  + e 

+ 

+ 

- 



n. +1 

The  *+’  and  denote  increases  and  decreases  of  0(e  ) 

respectively,  and  were  obtained  from  Lemma  (79)  by  noting  that 

x (O  > 0. 

i 

The  case  that  we  shall  be  most  interested  in  is  that  of  = 0, 

i.e.  c (t*)  / 0.  This  is  because,  as  we  shall  next  indicate,  it  is 
ri  1 

the  only  case  in  which  a decrease  obtained  by  the  above  adjustment 
always  dominates  any  possible  increases  due  to  the  adjustment  of  the 
time  points  ty  > t^.  However,  if,  for  example,  no  other  breakpoints 
need  be  adjusted,  then  shifting  t^  according  to  Table  2 will  yield 
a decrease  for  n^  > 0. 

It  is  interesting  to  note  the  analogue  of  Lemma  (79)  (with 
= 0)  in  the  simplex  method;  if  a variable  enters  a basis  3 

^Recall  that  n,  is  the  least  integer  such  that  c^ni^(t^)  ^ 0. 

1 T . 1 
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replacing  xr  and  forming  a new  basis  g',  then  the  reduced  cost  of 

x with  respect  to  p'  is  opposite  in  sign  to  the  reduced  cost  of 
r 

x.  with  respect  to  p. 

« 

To  complete  the  procedure,  it  remains  for  us  to  follow  the  same 
course  as  in  Section  4.2.2,  and  determine  whether  or  not  the  breakpoints 

tr  > t^  can  be  adjusted  so  as  to  accommodate  the  change  at  t^  and 

yield  an  overall  decrease.  It  should  also  be  checked  that  the  adjust- 
ment at  t^  has  caused  no  immediate  infeasibilities,  i.e.  some  other 
variables  becoming  negative  near  t^. 

This  can  all  be  done  by  directly  applying  the  tools  and  results 
of  Section  4.2,  and  the  details  will  be  omitted.  Proposition  (58) 
applies  as  it  stands  with  q = Oj  that  is,  the  shifts  in  the  break- 
points, t£  - tr>  are  0(e)  and  (w^  - wy)^(t)  = 0(e),  r > i.  When 

n^  = 0 and  we  have  determined  tj  by  Table  2 so  as  to  yield  a local 
decrease,  Theorem  (76)  too  applies,  with  q = 0,  to  yield  an  overall 
decrease  of  0(e). 


4.4.  Improving  the  Solution  When  Some  d^  ^ 0 

Jo 

From  section  4.1  we  see  that  the  {5^*)  are  dual  infeasible 
if  for  any  i,  l,  and  j > 1 


or  d^1  / 0 . 1 


1From  now  on  it  will  be  understood  that  "d'j1  / 0"  means  d^1-  > 0 
when  j = 0. 
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In  such  a case,  even  though  we  may  have  c(t)  > 0,  t € [0,T],  it  may 
still  he  possible  to  improve  the  solution  by  making  basic,  in  some 
suitable  way,  the  variable  x^(t”).  One  such  case  was  considered  in 
Section  4.3  where  with  i = i it  sometimes  paid  off  to  move  the 
breakpoint  t,.  In  general,  however,  a construction  along  the  lines  of 
that  given  in  Section  4.2.1  is  required.  Thus,  if  d^  / 0,  we  wish 
to  increase  = 9 on  a neighborhood  of  t^  in  such  a way  that 

0^^(tjL)  >0  if  3]1  > 0 , 

(84) 

0^  (t?)  <0  if  df<0. 

1 X 


This  is  because  -d^  appears  as  the  coefficient  of  x^(t  ) in  (3). 

X*  Xj  l. 

It  is  important  to  note  that  if  d^  ^ 0,  we  are  able  to  make 

X 

0(1)  changes  in  x ^ only  for  some  q > j.  This  follows 
from  Lemma  (67).  From  this  we  also  see  that  if  we  are  to 
make  x^  = 9 basic  over  [t^  - e^,  t^  + e^),  + e2  = e> 

dominating  local  contribution  to  the  objective  will  be  given  by 


where  j*  is  the  largest  j such  that 

df  /0. 

Thus  we  shall  pick  our  perturbation  0 such  that  (84)  holds  with 
j = j*.  See  Figure  8 below  for  an  illustration  of  the  case  j*  = 1. 
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FIGURE  9 


I H* 


We  remark  that  when  d' 


a. 


he  obtained  by  just  shifting  t^ 

by  Corollary  (7M  and  v*  > 0, 

qi 


> 0,  a local  improvement  can  always 
This  follows  from  Table  2,  since 


-i*  i / ni  -^ni^  + 

4:  >»*(-»  v (*i 


) > o . 


The  unfortunate  part  about  in  general  attempting  to  improve 
the  solution  when  some  d^1  / 0,  is  that  even  though  we  may  be  able 
to  obtain  a local  decrease  in  the  objective  value,  adjusting  the  break- 
points to  the  right  of  this  change  may  sometimes  yield  an  overall 
increase  in  the  objective  value.  This  was  seen  to  be  the  case  in 
Section  U.3  with  n^  > 0,  or  when  we  needed  to  adjust  t = T. 

Further,  when  p^  > 0,  we  may  not  even  be  able  to  obtain  a local 
improvement  in  a neighborhood  of  t^.  However,  these  are  precisely 
the  situations  in  which  the  complementary  slackness  conditions  yield 
alternative  dual  variables  A*(’)  and  {v^)  and  which  will  be 
considered  in  the  next  section. 
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One  particular  case  where  an  improvement  in  the  objective  value 


i 


is  always  possible,  is  at  the  endpoint 
construction  is  as  follows: 


t,  = T when  x (T)  > 0. 
k ek 


The 


(1)  5f'k>0. 

Carry  out  the  first  step  of  the  construction  of  Q in  Section 
4.2.1  on  the  interval  [T  _ €j  T],  for  e > 0 and  small.  Thus  simply 
malce  the  first  basis  change  from  fS  to  over  the  whole  of  this 

interval.  This  will  yield 

e(j*>(T)  = Meq'j*  + 0(eq-^+1) 


some  M > 0,  and  also 

0(j)(T)  = 0(eq'J)  , 0 < j < j*, 


where  q is  determined  as  in  (17).  The  contribution  to  the  objective 

value  then  becomes,  using  (63), 

T j* 

/ C (t)  e(t)dt  - Z dJ‘k  0(j)(T) 

T-e  1 j=0 

= -df’k  M£q-jV(q  - J*)!  + 0(cq"^+1) 

While  Lemma  (67)  does  not  apply  to  the  endpoint  t^  = T,  it  can  still 
be  easily  shown,  by  using  (70)  in  the  proof  of  Lemma  (67),  that  q > j*. 
From  this  it  therefore  follows  that  a strict  decrease  is  obtained  for  e 
sufficiently  small. 
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' .<• 


< 0. 


(ii)  dj*’k 

Partition  [T  - e,T]  into  several  subintervals  over  which  we 
alternate  the  sign  of  0^q\  as  in  Section  4.2.1,  in  such  a way  that 
gVj  indeed,  by  the  results  of  Section  4.2.1,  we  can  choose 

the  intervals  so  that 

g(  j*)  (t)  = + 0(eq-^+1) 

for  some  M > 0,  and  also 

e(j)(T)  = 0(eq"J),  0 < j < j*. 

As  before,  this  yields  a strict  decrease  for  e sufficiently  small. 

When  j*  = 1,  the  0 so  constructed  will  be  as  in  Figure  8 
except  with  the  solution  truncated  beyond  t^. 

At  points  t^,  0 < t^  < T,  we  remark  only  that  a similar 
construction  applies  with  the  exception  that  we  need  to  continue 
beyond  t^  to  meet  requirements  of  the  from  (20).  In  the  event  that 
i = iL,  these  requirements  need  not  be  met,  as  illustrated  in 
Figure  8. 


4.5.  Ambiguity  in  the  Dual  Variables 

In  the  previous  three  sections  we  considered  improving  our 
given  basic  feasible  solution  when  the  dual  variables,  defined  by 
complementary  slackness  as  in  (9),  were  infeasible.  However,  as  was 
noted  there,  obtaining  an  improved  solution  was  not  always  possible, 
for  the  following  reasons: 
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(i)  Making  a change  of  basis  over  an  interval  of  length  e may 
force  us  to  adjust  other  breakpoints  to  the  right  of  this 
interval,  and  these  may  violate  the  condition  of  Proposition  (58) , 
namely  pj^  = p^. 

(ii)  Should  this  change  of  basis  force  tfc  = T to  move  left,  it  has 
to  be  disregarded  if  there  is  either  no  feasible  extension  of 
x on  a neighborhood  to  the  right  of  T,  or  if  the  result 
of  allowing  t'  to  adjust  yields  an  overall  increase  in  the 

Iv 

objective  value. 

(iii)  As  was  noted  in  Section  4. 3 and  4.4,  making  a change  of  basis 

on  a neighborhood  of  some  t.  when  some  d^1  / 0,  was  in  general 
not  possible  when  p^  > 0.  Even  when  we  had  p^  = 0,  there  were 
seen  to  be  cases  where  the  need  to  adjust  later  time  points  led 
to  an  overall  increase  in  the  objective  value. 

What  these  cases  all  have  in  common  is  that  they  are  being 

'blocked'  by  some  ^ having  p^  > 0,  i.e.  the  variable  xr  (t^) 

i 

leaving  the  basis  is  zero.  This  means  that  the  complementary 

slackness  condition 

/ -v  -0i  1 

Xy.  (t  ) > 0 d = 0 , 

ri  1 i 

does  not  apply,  and  therefore  that  the  dual  variables  are  no  longer 
uniquely  defined. 

For  such  a t.,  let  us  now  relax  the  restriction  d°i  =0  by 
i 

setting  d0i  = y (y  some  scalar)  and  determine  the  dependence  of  the 
^See  Section  4.1, 
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dual  variables  and  reduced  costs  on  y.  Denote  these  by  A*(t|y), 
(v^r(r)},  c(t|y)  and  (d^r(y)},  where  i here  is  fixed.  In  this 
analysis  we  shall  maintain  as  before  all  the  remaining  relations  in 
(9)  defining  A*  and  { v^r) . Let  p be  such  that  ^ . 

(85)  Lemma.  Let  7 r*(-)  and  {MJ'r}  satisfy 


(i)  tt  (t)  = 0 , 

Jr  - n 

M = 0 , 

*■  ■ ^ *'  • Oi. 


t S [t.,T] 


r > i,  j > 0 


(ii)  tt  (t)B.p  + / v (s)ds  K.p  + u K.p  =0,  t€[ti_l’V 


Oi  li  _ T 

„ B.6i  + „ K.^  - ep 


M 


J ^"p  + <]'+l>  iv-  _ n 

B-r  + M k.r  - 0 , 


J > 1 


1 *■  • i Ov 


(iii)  7 r (t)B  + / tt  (s)ds  K.  + Z ^ . 1 (t  - t)  = 0, 

P-  V 


r v=r 


t € [t  +.  ) 
r-J.  r ' 

MJrB.  + uJ+1,rK..  =0,  j > 0,  r = 1, . , . , i-1. 

pr  Pr 


Then 


A (t|y)  = X*(t|0)  + yfr*(t) 
v,]r(r)  = vjr(0)  + r^T 


(86) 


*.  • -T  . A Ov 


c(t|r)  = c(t|o)  - r{77  (t)B  + / 77  (s)ds  K + Z n°VK  1 (t 

t v=l  v 

aJr(r)  = 3Jr(o)  + r(uJrB  + MJ+1,rK)  . 


-t)) 
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and  (vJi(r))  by 


Proof.  This  follows  by  noting  that  A (‘If) 
definition  satisfy  (9)  with  the  relation 

v°LB.0  + vUK.p  = 0 
Pi  Pi 


replaced  by 

v0i(r)B.^+  vU(r)K.p_  = re^  . □ 


The  relations  in  (86)  show  that  the  dependence  of  A (*|y),... 

*/ 

on  y takes  on  exceedingly  simple  form.  Note  that  it  ( • ) and 
i r 

} are  uniquely  determined  and  are  independent  of  y.  Note  also 

that  by  construction,  A (• |y)  and  (vjr(y))  satisfy  the  complementary 

slackness  conditions  for  any  y.  In  particular,  if  for  some  y, 

the  triple  (A  ( * | y ) , {v^r(y) ),  { tr } ) is  dual  feasible,  then  our 

current  basic  feasible  solution  is  optimal. 

_0i 

Using  this  device  of  setting  d = y and  parametrizing  the 

i 

dual  variables  in  terms  of  y is  an  essential  ingredient  of  any 

continuous  time  simplex  method.  This  is  amply  demonstrated  by 

Example  E(2),  where  the  constraints  are  such  that  there  is  only  one 

feasible  solution— hence  trivially  optimal — and  yet  the  dual  variables 

defined  by  (9)  in  the  usual  way  are  dual  infeasible.  However, 

parametrizing  in  terms  of  y does  yield  (^*(*|y),  { v'^r(y)}, 

{ t^. ) ) dual  feasible  for  y < -1. 

Both  Lehman  [18]  and  Drews  et  al.  [12]  recognized  the  need 

for  this,  and  suggested  extending  the  concept  of  a basis.  We  now 

interpret  their  approach  within  our  framework. 
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Suppose  that  we  are  attempting  to  find  an  improved  solution 

using  the  techniques  of  Sections  4.2,  4.3  and  4.4,  but  that  we  get 

"stuck"  in  one  of  the  situations  mentioned  at  the  beginning  of  this 

section.  For  example,  suppose  that  at  some  t , c^(t  ) < 0,  and  that 

* * 

in  trying  to  increase  on  [t  , t +e),  we  find  that  for  all  e > 0 


we  are  forced  to  move  a breakpoint  t^  that  fails  to  satisfy  p^  = p^. 

We  then  parametrize  the  dual  variables  in  terms  of  y by  relaxing 

_0i 

the  restriction  d =0,  and  thereafter  fix  the  value  of  y by 
ri 

imposing  the  restriction 

(87)  c^t*^)  = 0 . 

This  corresponds  in  discrete  time  to  making  basic  at  the  point 

* 

t , and  dropping  x from  the  basis  at  the  point  t, , with  no  change 
ri 

in  the  objective  value.  In  the  simplex  method  it  is  therefore  a 

degenerate  pivot  whose  only  effect  is  to  determine  an  alternative 

complementary  set  of  dual  variables.  The  same  can  be  done  in  continuous 

time  by  redefining  which  variables  are  basic  at  certain  isolated  points. 

In  [12]  this  is  termed  as  a "point  pair." 

Since  our  boundedness  assumption  does  not  allow  any  6's  in 

the  primal  problem,  assigning  basic  and  non-basic  labels  to  variables 

at  isolated  points  can  have  no  effect  on  the  primal  solution.  However 

this  does  have  a marked  and  important  effect  on  the  dual  variables. 

* 

Observe  that  making  x.  basic  at  t and  dropping  x at  the 

K,  ri 
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point  t^  gives  a count  of  m+1  variables  basic  at  t = t*  and  m-1 
basic  at  t = t^.^ 

* 

The  next  step  is  now  to  move  t either  left  or  right, 
depending  on  whether  or  not  c^(t*)  is  >0  or  ■ 0,  while  simultaneously 
adjusting  y so  as  to  maintain  (87).  The  aim  of  doing  so  is  to  try 
to  obtain  either  dual  feasibility,  and  hence  a proof  of  optimality, 
or  a new  interval  over  which  we  can  attempt  another  basis  change. 

Whether  or  not  a procedure  of  this  nature  will  always  yield 
either  a proof  of  optimality,  or  an  eventual  strict  decrease  in  the 
objective  value  is  not  known.  It  is  also  still  an  open  question 
whether  or  not  use  of  this  "point  pair"  technique  is  needed  only  to 
obtain  a proof  of  optimality.  Posed  another  way,  can  we  have  a solution 
that  is  not  optimal,  but  has  the  property  that  at  no  point  where  the 
dual  variables,  defined  in  the  usual  way  by  (9),  are  infeasible 
can  we  make  a change  of  basis  using  the  techniques  of  the  previous 
sections  and  obtain  a decrease  in  the  objective  value. 


^"By  Theorem  2(15)  we  know  that  every  right  analytic  extreme  point  has 
at  most  m variables  positive  over  any  interval.  Hence  having  a 
pair  of  intervals  with  this  surplus -shortage  property  is  not  possible. 
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CHAPTER  5 


CONCLUSION 

This  dissertation  has  been  aimed  at  establishing  some  of  the 
fundamental  concepts  and  results  required  to  develop  a continuous  time 
simplex  method  (for  the  case  of  constant  coefficients  in  the  constraints). 

To  this  end,  we  have  accomplished  the  following: 

(i)  A characterization  of  all  so-called  right  analytic  extreme 
points  that  allows  us  to  work  with  them  as  we  would  with  the 
basic  feasible  solutions  in  linear  programming. 

(ii)  A distribution  free  statement  of  the  optimality  conditions. 

(iii)  A means  of  moving  from  one  extreme  point  to  another,  with  an  \ 

improvement  in  the  objective  value. 

Much  remains  unanswered  from  both  the  algorithmic  and  purely 
mathematical  points  of  view. 

On  the  algorithmic  side  there  is  the  need  to  combine  the  con- 
structions of  Chapter  4 with  an  effective  heuristic  that  chooses  the 
intervals  over  which  to  attempt  a basis  change,  and  also  decides  on 
how  large  to  make  e before  recomputing  the  prices.  Developing  such 
a heuristic  should  go  hand-in-hand  with  the  design  of  a computer 
implementation.  Based  on  small  examples  solved  by  hand,  it  would 
seem  that  an  initial  implementation  should  be  highly  interactive  in 
nature.  Of  great  importance  in  any  implementation  will  be  the  question 
of  how  to  solve,  numerically,  equations  of  the  form 
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t 

Dy(t)  + L / y(s)ds  = d(t) 

0 

where  the  only  condition  on  D and  L is  that  (^D  + L)-1  exists 
for  some  scalar  ji.  1 

More  fundamental  is  the  question  of  convergence,  about  which 
nothing  has  been  said.  This  is  related  to  the  (unanswered)  question 
of  when  is  the  optimum  attained  at  a piecewise  analytic  solution 
having  only  finitely  many  breakpoints.  It  is  also  intimately  connected 
with  the  question  of  whether  the  need  to  take  advantage  of  ambiguities 
in  the  dual  prices,  alluded  to  in  Section  4.5,  occurs  only  (if  at  all) 
to  obtain  a proof  of  optimality,  or  whether  it  can  occur  before  the 
optimal  value  is  reached. 

The  last  major  open  question  is  whether  or  not  the  sufficient 
conditions  for  optimality,  presented  in  Section  4.1,  are  also  necessary 
conditions  when  the  optimum  is  attained  at  a piecewise  analytic  solution 
having  finitely  many  breakpoints. 
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APPENDIX  A 


RIGHT  ANALYTIC  FUNCTIONS 

This  appendix  contains  the  propositions  about  right  analytic  func- 
tions that  are  required  in  Section  2.  The  main  result  is  the  proof  of 
Lemma  2.2(7)  which  we  restate  below  as  Lemma  (3).  For  completeness  we  review 
the  definitions  of  analytic  and  right  analytic  functions. 

(1)  Definition:  [22].  Let  OCR  be  open  and  g;ft  -> R.  Then  i 

g is  said  to  be  analytic  on  fi  if  to  every  open  interval  I CH  with 

i 

center  a,  there  corresponds  a series  c^t-a)1  which  converges  to 

g(t)  for  all  t £ I. 

(2)  Definition  A function  gj[0,T]  ->  R will  be  called  right  analytic 
if  for  each  t 6 [0,T)  there  is  an  e > 0 and  an  analytic  function 
h:(t-c,  t+c)  -* R such  that  g(s)  = h(s)  for  all  s £ [t,t+e). 

(3)  Lemma ■ Let  g:[0,T]  -»  Rn  have  right  analytic  components  g.^-), 

i = 1, ...,n.  Then  there  exists  a (possibly  infinite)  disjoint  family 

of  open  intervals,  (I.)  such  that  [0,T)  = U.  I , and  such  that  for 

J 0 

each  interval  Ij,  each  g^  satisfies  j 

(i)  g^  is  analytic  on  1^,  and 
(li)  either  | g^|  >0  on  1^  or  g1  = 0 on  1^. 

^If  I = (t',tM)  is  an  open  interval  then  I denotes  the  interval 
ft',t"). 
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The  proof  of  this  result  will  require  the  following  lemmas. 


J - 
* %. 


(4)  Lemma.  Let  ICR  be  an  open  interval  and  {J^}  a family  of 

open  intervals  whose  union  is  I.  Let  g: I -> R be  given.  If  g is 
analytic  on  each  then  g is  analytic  on  I. 

(5)  Lemma.  Let  K C R be  a compact  interval,  and  let  h be  an  analytic 
function  defined  on  a neighborhood  of  K.  Then  either  h = 0 on  K 

or  h has  finitely  many  zeros  in  K. 

(6)  Corollary.  Let  ICR  be  an  open  interval,  and  let  h;  I -»  R 
be  analytic.  Let  Z(h)  = (t  £ I;h(t)  = 0).  Then  either  Z(h)  = I, 
or  Z(h)  has  no  limit  point  in  I.  In  the  latter  case  Z(h)  is  at 
most  countable. 

The  proofs  of  both  these  lemmas  may  be  found  in  [22]. 


(7)  Lemma.  Let  { J^)  he  any  family  of  open  intervals,  and  {1^}  be 
a disjoint  family  of  open  intervals  such  that  1^  = UQ  J^.  Then 


U J C II 
a a - j 


Proof.  Since  the  collection  [Ij]  is  disjoint,  the  connectedness  of 
intervals  implies  that  each  JQ  is  contained  in  a unique  Ij.  Thus 
Ja  C Ij  for  some  J and  we  are  done.  □ 


113 


1 

-*  - •“•cy 


Proof  of  Proposition  (5).  We  shall  prove  the  proposition  for  the  case 
n = 1,  since  from  this,  the  general  case  follows  immediately. 

By  definition  of  g being  right  analytic,  for  each  t € [0,T) 

there  exists  an  > 0 such  that  g is  analytic  on  K^.  = (t,  t+e^)  C [0,T). 

Let  V = U^_  ^ and  W = U K^.  Since  V is  open,  there  exists  a disjoint 

00 

family  of  open  intervals  whose  union  is  V. 

On  each  we  now  obtain  the  following:  By  Lemma  (J+)  g is 

analytic  on  J^.  By  corollary  (6)  either  g = 0 on  or  g = 0 on 

at  most  a countable  sequence  {t^,t2,...)  c J^.  In  the  latter  case  write 
= (t',t").  Since  the  sequence  has  no  limit  point  in  J^,  and  since 
by  definition  g agrees  with  an  analytic  function  defined  on  a neighbor- 
hood  of  t’,  the  only  possible  limit  point  of  {t^}  is  t".  Hence  we 
may  assume  that 

t'  < t^  < t2  < • • • < t"  . 

With  this  partition  of  we  can  now  conclude  that  there  is  a (possibly 

infinite)  collection  of  disjoint  open  intervals  {L. ) such  that 
|g|  > 0 on  L^  or  g = 0 on  L^,  and  such  that  = U L*'. 

Now  it  is  clear  that  W = [0,T).  By  Lemma  (7),  it  follows  that 
^i  ^i  = Hence  ^ L^  = [0,T) . By  relabeling  the  fami  y 

(L^)  as  (1^),  we  obtain  the  desired  result.  □ 

(7)  Lemma.  Let  g,  h:[0,T]  -> R be  right  analytic.  Let  tQ  £ [0,T)  be 
such  that  g(tQ)  / h(tQ).  Then  there  exist  0 < r < s < T such  that 
g = h on  [0, r)  and  g / h on  (r, s). 

llU 


Proof.  Let  E = {t;g(t)  ^ h(t)}.  Since  tQ  £ E,  E is  nonempty.  Let 
r = inf  E.  Then  by  definition  of  r,  g h on  [0,r).  Since  both 

g and  h are  right  analytic  there  is  an  e > 0 such  that  g and  h agree 
with  analytic  functions  on  [r, r+e].  Again  by  definition  of  r,  there 
is  a t € (r,  r+e)  such  that  g(t)  / h(t).  By  Lemma  5,  g - h 
has  finitely  many  zeros  on  [ r,  r + e].  Hence  there  is  an  s £ (r,  r + e) 
such  that  g ^ h on  ( r, s ) . □ 
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EXAMPLES  OF  EXTREME  POINTS 


The  constraints  are  all  of  the  form 

t 

B(t)  x(t)  + j K(t,s)  x(s)ds  = b(t) 
0 


x(t)  > 0 


for  a. e.  t £ [0, T] . B(t) 


and  K(t,  s)  £ Rm  X n. 


(1) : An  extreme  point  that  is  not  locally  uniquely  defined,  and  is 
independent  of  small  changes  in  T. 


i 


'i 


m = 1,  n = 3,  T = 5,  B(t)  =0  on  [0,5].  Define  K(t,s)  = 

[k,  (t,  s)  k (t, s)  k, (t, s) ] on  the  triangle  0 < s < t < 5 as  follows: 


Set 


SjU)  = x2(t)  = x (t)  =1  on  [0,1) 

xL(t)  = x2(t)  = x,(t)  =0  on  [1,5] 


and  then  define  b(*)  by 
t 

b(t)  = f K(t,s)  x(s)ds,  t G [0,5] 

0 


Consider  now  the  specific  values  of  K(t, s): 

(i)  For  t £ [0,1)  the  equation  is  trivial,  i.e.,  0 = 0 so  that 
the  values  of  x on  [0,1)  are  not  in  any  way  determined  by 
the  coefficients  on  [0,1). 

(ii)  For  t 6 [1,2)  the  equation  reads 

1 . 

/ e s x..  (s)ds  = b(t)  . 

0 

The  left-hand  side  is  the  Laplace  transform  of  x.,  on  the  interval 
[0,1),  evaluated  at  t.  By  the  uniqueness  theorem  for  Laplace 
transforms  (see  Theorem  C.12)  the  above  equation  has  a unique 
(a.e.)  bounded  solution  x^  on  [0,1).  By  construction  this 
solution  is  x^  = x^  = 1. 

(iii)  For  t € [2,3)  and  t £ [3,  U)  we  obtain  similar  equations  in 
x2  and  x^  respectively,  and  conclude  that  the  only  possible 
solution  is  x2  = x2  and  x^  = x^  on  [0,1). 
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The  above  shows  that  x on  [0,1)  is  uniquely  determined, 
independent  of  the  choice  of  x on  [1,5].  Thus  choosing  x = 0 
on  [1,5]  yields  x = x on  the  whole  interval,  and  this  must  be  an 
extreme  point  solution. 

Notice  that  we  have  one  equality  constraint  in  three  nonnegative 
variables  and  that  on  [0,1)  all  three  are  positive  while  on  [1,5] 
all  three  are  zero.  □ 

(2) : An  extreme  point  that  is  locally  uniquely  defined  except  at  the 
origin,  and  whose  values  throughout  the  interval  depend  explicitly 
on  T. 

m = n = 1,  B(t)  = t,  b(t)  = -t  and  K(t, s)  = -2.  Thus 

t 

tx(t)  - 2 / x(s)ds  = -t 
0 

x ( t ) >0,  t € [0, T] . 

One  can  easily  show  that  the  only  solutions  to  this  equation  are  of 
the  form  x(t)  = 1 + at  for  OC  an  arbitrary  scalar.  a is  the 

derivative  of  x at  0 which,  once  determined,  yields  a unique 
solution  over  the  remainder  of  the  interval.  To  obtain  an  extreme 
point  solution  we  choose  the  least  a such  that  1 + at  > 0 on  [0, T], 
Thus  the  only  extreme  point  is  x(t)  = 1 - t/T. 


"■  r i i | ~ Pi  I * I • ‘ 
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Note  that  if  we  set  a = 0 (say)  and  choose  the  partition 

(I.)  with  I.  = (l/j+1,  l/j),  then  x(t)  = 1,  t € [0,T],  is  uniquely 
J J 

determined  on  each  interval  [l/j+1,  l/j).  However,  it  is  not  uniquely 
determined  on  [0,T],  and  is  also  not  an  extreme  point.  □ 
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APPENDIX  C 


L 


y 


DISTRIBUTIONS,  LAPLACE  TRANSFORMS  AND  EQUATIONS  OF  THE  FORM  Dx  + L / x = g. 

This  appendix  summarizes  the  techniques  and  results  required 
to  study  the  solutions  to  equations  of  the  form 

t 

(1)  Dx(t)  + L / x(s)ds  = g(t)  , t > 0 

0 

where  D,  LG  RmXm  and  g(>)  is  analytic.  We  remark  that  by  a 
suitable  change  of  variable  and  time  direction,  all  the  results 
pertaining  to  (l)  will  be  equally  valid  for  equations  of  the  form 

T 

(2)  X(t)D  + / Ms)ds  L = g(t)  , t < T . 

t 

The  material  presented  here  is  well  known,  and  we  refer  the 
reader  to  [27],  [11]  and  [ 5 ] for  further  details. 

When  D exists,  (l)  becomes  the  much  studied  Volterra  equation 
of  the  second  kind.  This  can  be  shown  to  have  the  unique  solution 

(3)  x(t)  = D_1g(t)  - D-1L  f e"D  D-1g(s)ds  ,1 

0 

However,  when  D is  singular  (1)  may  not  have  a solution  in  any  Revalued 
space  of  functions.  Consider  the  following  example. 

■ ■ — oo  n n 

^If  G G RmXm  then  eGt  denotes  the  matrix  exponential  E G y- 

n=0 
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(k)  Example. 


x.  (t)  - f x (s)ds  0 
0 

xL(t)  = 1 


Clearly,  if  x,  is  restricted  to  be  a real  valued  integrable  function, 
then  these  equations  have  no  solution.  However  we  can  meaningfully 
work  within  a larger  class  of  'functions,'  the  space  of  distributions, 
where  these  equations  do  have  a unique  solution. 

C. 1.  Distributions 

Let  D denote  the  space  of  all  C°°  functions  cp ; R ->  R that 
vanish  outside  some  bounded  interval  (depending  on  cp).  Then  the  space 

of  distributions,  f)',  is  defined  to  be  the  set  of  all.  continuous  linear 

1 

functionals  on  d.  Thus,  if  h £ o',  then  to  every  9 € £>,  h assigns 
a real  number,  to  be  denoted  by  (h/p  ).  Locally  integrable  functions 
may  be  considered  a subspace  of  D':  if  f;R  -*R  satisfies 

M 

/ | f (t)  | dt  < °0 

-M 

for  all  M,  then  the  functional  defined  by 

^See  [27]  for  the  notion  of  a sequence  (cpn)  converging  in  D. 
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<f,cp)  = / f(t)  cp(t)dt 

-00 

is  a continuous  linear  functional  on  £,  and  hence  is  a member  of 

D' . Distributions  generated  this  way  are  called  regular  distributions. 

Of  particular  interest  here  will  be  the  6 functional  and  its 
derivatives.  5 is  a distribution  defined  as  follows; 

<6,9)  = cp(0)  , 9 - 0 • 


Thus  if  we  think  of  formally  integrating  8 against  9, 

00 

l 5(t)  cp(t)dt,  then  8 may  be  thought  of  as  a 'function'  that  is  zero 

-00  00 

everywhere  except  at  the  origin,  where  it  is  so  large  that  / &(t)dt  = 1. 

—00 

Indeed,  distributions  were  invented  precisely  to  make  formal  notions 
of  this  kind  rigorous. 

Distributions  allow  us  to  differentiate  functions  that  do  not 
have  derivatives  in  the  usual  sense.  The  motivation  for  the  definition 
of  distributional  derivatives  is  the  following.  Let  f;R  ->  R be  con- 
tinuously differentiable.  By  definition 

(f(l^,9>  = / ff^(t)  <p(t)dt 
-00 


Integrating  by  parts  and  noting  that  9 vanishes  outside  a bounded 
interval,  yields 


<f(l\rp>  = - / f(t)  9^(t)dt 
= -<f,  9(1)> 
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Since  cp^  is  also  in  D,  the  right  hand  side  makes  sense  when  regarding 
f as  a regular  distribution.  Thus  for  rp  6 o,  h £ 8',  define  h^ 
hy 

<h(l),  cp>  = - <h,  p(l))  , 

and  more  generally 

(h(l),cp)  = (-1)1  (h,cp(1^)  . 

In  the  case  of  6,  we  get 

(5)  cp>  = (-1)1  <p(l)(0)  . 


With  the  above  notion  of  the  6,  we  can  now  solve  Example  (4) 
by  inspection. 


xL(t)  = l+(t) 
x2(t)  = 8(t) 


What  we  mean  by  writing  the  solution  formally  in  this  way  is  that,  for 


D, 


and 


(x. , <p)  = / cp(t)dt 
0 


<x2,cp)  = cp(0)  . 


In  general,  it  will  turn  out  that  the  solutions  to  (l)  contain  linear 
combinations  of  (8^}. 
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An  important  concept  in  the  development  of  solution  techniques 
for  (l)  is  that  of  convolution. 

If  f,  g: R -> R are  two  continuous  functions  that  vanish  on 
(-oo,0),  then  their  convolution,  f*g:R  -*  R is  defined  '>y 

t 

(f*g)(t)  = f f(t-s)  g(s)ds 
0 

Notice  that  f*g  = g*f.  This  definition  can  be  extended  to  the  case 
of  f and  g being  distributions  provided  f,  g 6 D|,  where 

C o'  denotes  the  space  of  distributions  'vanishing'  on  (-oo,0). 

By  this  we  mean  that  f € satisfies  (f,cp)  = 0 for  all  cp  € D 
vanishing  on  [0,<»).  See  [27,  p.  122]  for  further  details.  Of  importance 
here  will  be  the  result 

6*f  = f 

(6)  6<i>*f  . fU>  . 

for  f € D|. 

We  shall  be  applying  (6)  to  functions  of  the  form  l+(t)f(t) 
where  f;R-»R  is  infinitely  differentiable.  Clearly  (l+f)^ 
exists  only  in  the  distributional  sense.  It  can  be  shown  that 

(7)  (l+f)^  - l+f^  + 6f^1_1^(0)  + •••  + f(0)  . 
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C.2.  Laplace  Transforms 


The  Laplace  Transform  is  an  extremely  useful  tool  for  computing 

and  manipulating  solutions  to  equations  of  the  form  (l).  The  reason  for 

this  is  that  it  allows  us  formal  algebraic  manipulations,  the  results 

of  which  are  correct,  usually  requiring  no  further  justification. 

If  f : R — > R is  such  that  f(t)  =0,  t < 0,  and  for  some 
*c  t 

c € R,  e f(t)  is  absolutely  integrable  over  R,  then  the  Laplace 
transform  of  f is  defined  by 

f(s)  = /"  e'St  f(t)dt 
0 

where  s € (E,  Re  s > c.  The  alternative  notation  <£(f)(s)  will  also 
be  used. 

With  some  care,  this  definition  can  also  be  extended  to  distri- 
butions. 

For  f £ D ! , define 

f(s.'  = (f,  e_st) 

whenever  the  right-hand  side  makes  sense.  This  will  occur  when,  for 
some  c,  e"ct  f (t)  is  in  a certain  subspace  J 1 CD1  called 
distributions  of  slow  growth.  Any  such  f will  be  called  Laplace- 
transformable.  For  our  purpose  it  suffices  to  remark  that  the  family 
{6^}  is  Laplace-transformable,  and  moreover  that 

(8)  ^(6(1))  = s1  . 
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Other  Laplace  transform  formulae  that  will  be  important  are 


(9)  ^(p-  tk  l+(t))  = , Re  s > 0 

s 


(10)  /(p-  tk  eMt  l+(t))  = n^l  ’ Re  s > Re  u 

(s-p) 


(11)  f*g(s)  = f(s)  g(s) 


The  following  uniqueness  theorem  gives  a 1-1  correspondence 
between  Laplace-transformable  distributions  and  their  Laplace-transforms. 


(12)  Theorem.  (Uniqueness  Theorem).  Let  f and  g be  Laplace- 

/V  /V 

transformable  distributions  in  If  f(s)  = g(s)  on  some  vertical 

line  s = c + in  their  regions  of  convergence,  then  f = g. 


Proof.  See  [27,  p.  225].  □ 


C.3.  Solving  the  equation  Dx  + Lfx  = g 

With  the  above  results  we  can  now  proceed  to  solve 
t 

Dx(t)  + L / x(s)ds  = g(t),  t > 0 
0 


^ A A 

J'It  suffices  to  require  that  f(s)  = g(s)  over  some  segment  of  an 
arbitrarily  oriented  line  in  their  common  regions  of  convergence. 


in  the  space  of  distributions.  We  begin  by  rewriting  the  equation  as 


(W) 


(D6  + Ll+)*x  = lfg. 


xi  € K * 


Taking  Laplace  transforms  on  both  sides  (assuming  x is  Laplace- 
transformable)  yields 

(D  + | L)  x ( s ) =/(l+g)(s) 
by  using  (8),  (9)  and  (ll). 

Assume  now  that  (D  + i L)  ^ exists  for  some  s (and  hence 
for  all  except  finitely  many  s).  Then 


x ( s ) = (D  + i L)'1^(l+g)(s). 


From  (ll)  we  see  that  if  H = (lu  ..)  is  an  m x m matrix  with  entries 

h.  . € D|  and  whose  Laplace  transform  is  (D  + - L)”1,  then  x = H*(l  g). 

By  Cramer's  rule  for  computing  the  inverse  of  a matrix,  it  is 

clear  that  (D  + — L)  ^ = (r  (s))  has  entries  that  are  rational  functions 
s lj 

of  s.  Indeed  each  r^..  has  the  form 


ij 


w-Hf 


pu(s) 


»T" 


where  the  p^  and  q are  polynomials  in  s of  order  < m,  and 
q(s)  = det(sD  + L).  A partial  fraction  expansion  of  each  r^  yields 
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P *K  ^ 

(D  + J L)"1  = Z U sk  + Z Z V ^TI 

s k=0  K k=l  £=0  ^ (s-|J  L 


where  Z£=1  = degree  of  q,  Uk)  are  the  roots  of  q with  multi- 

plicities {n^),  and  the  matrices  {UjJ  C RmXm,  {V  } C I®  Xm. 

From  (8),  (10),  (14)  and  Theorem  (12)  we  obtain  (D  + ^ L)”1 
as  the  Laplace  transform  of 


m P | t 

(15)  H(t)  = Z U 8 k (t)  + Z Z V -i-  t£  e 1 (t)  . 

^ 1 n s\  " • 


k=l  £=0 


To  obtain  the  solution  x,  we  compute  the  convolution  H*(l^g). 
Using  (6)  and  (7)  yields 


m , . m-1  m , . . 

(16)  x(t)  = Z u.g  k (t)  + Z ( Z u.g(k-i-1,(o))  6(^(t) 

k=0  l=Q  k=£+l 


+ / Y(t-s)  g(s)ds, 
0 


t > 0 


where 


(17)  f(t)  = 


? "z  v JL  J 1 

k=l  «=0 


t > 0 


We  have  thus  established  the  following  result. 

^It  can  be  shown  that  ?(t)  £ RmX  n since  the  imaginary  parts  cancel 
in  complex  conjugate  pairs. 


••  ry* . 


(l8)  Proposition.  Let  D and 


L 6 R , and  g;R  -> R be  analytic. 
If  there  is  a p € R such  that  (pD  + L)  ^ exists,  then  the  unique 
solution  to  the  equation 
t 

Dx(t)  + L / x(s)ds  = g(t),  t > 0 

0 

in  the  space  is  given  by  (l6). 

Proof.  By  construction,  x given  by  (l6)  is  a particular  solution.  To 
show  it  is  the  unique  solution  in  D|,  suppose  that  y is  another 
solution.  Then  both  x and  y satisfy  (13).  By  construction  H(t) 
given  by  (15)  satisfies 

H*(D6  + Ll+)  = I& 


where  I is  the  m x m identity  matrix.  Convolving  H with  both  sides 
of  (13)  then  yields 

&*x  = H*(l+g) 

6*y  = H* ( l+g)  . 

By  (6),  it  follows  that  x = y.  □ 

We  remark  that  alternative  expressions  for  the  coefficients 
(U^)  and  'f'(t)  have  been  found  using  the  Drazin  inverse  of  a matrix. 
See  [ 5 ].  In  particular  it  is  shown  that  there  exist  matrices 
F,  G S pmXm  guch  that 

?(t)  = FeGt. 
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Since  these  expressions  yield  little  additional  insight  into  the  nature 
of  the  solutions,  we  shall  omit  them  here. 

Finally,  note  that  since  g is  assumed  analytic,  x is  the  sum 
of  an  analytic  function  and  a finite  linear  combination  of  {6^}.  In 
the  event  that  D 1 exists,  it  can  be  easily  shown  that  U,  = 0, 
k = 1, since  q then  has  degree  m.  In  this  case,  all  the  terms 
in  6 in  (lC)  vanish,  and  we  obtain  the  same  expression  as  in  (3). 


C. b.  An  initial  value  result 

(19)  Lemma.  if  gjR  R is  analytic  and  l+g  is  Laplace  transformable 
then 

/(£  aB^+lg)(s)  = £ vk+  Z g(j)(0)s-j 
k=0  K + k=0  * 0=0 

Proof.  Use  (8),  (9)  and  Theorem  8.3.3  in  [ 27>  p.  231].  □ 
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APPENDIX  D 


AN  EXISTENCE  AND  UNIQUENESS  THEOREM  RELATING  TO  THE  {T  } 

In  this  Appendix  we  study  the  equations 

+ ...  + (-l)q+1Zq  = p 

- * + (-l)q+1zq  = p 

zq  + •••  + (-l)q+1zq  = p 

on  the  simplex 

S = (z  € Rq:l  > z > •••  > z >0} 

for  given  0 < p < 1. 

These  equations  appear  in  Section  4.2.1. 

We  remark  that  they  are  closely  related  to  the  study  of  the 
Vandermonde  matrix  (see  e.g.  [17,  p.  36]) 


However,  the  usual  application  is  to  solve  a linear  system  of  equations 
whose  detached  coefficient  matrix  is  V(z),  for  some  given  fixed  z. 
Here  we  are  given  a solution 


to  the  equation 


y = 


V(z)y  = pe 


and  wish  to  determine  z. 

The  main  result  of  this  Appendix  is  the  following: 


(2)  Theorem.  The  system  of  equations  (l)  has  a unique  solution  in 
the  interior  of  the  simplex  S^. 

We  shall  require  several  results  for  the  proof  of  this  theorem. 


(5) 

(‘0 


Lemma.  For  any  p and  any  z £ Rq  satisfying  (l). 


^ ■ z2+1  + *•*  + (-l)q+V+1  = p{l  - IT  (l-zi)} 


i=l 


Proof.  Define  the  polynomial 

q 

f (oj)  = cn  TT  (cn  - z. ) 

i=l  1 


and  let  it  have  the  expansion 
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f (<d)  = CO 


1+1 


q 

a (o  + • • • + a,(n  . 

q i 


Multiply  the  ith  equation  in  (l)  by  ai  and  sum  over  i = 1, 
It  is  easily  seen  that  the  resulting  equation  is 


{f(Zl)  - zq+1}  + ...  + (-l)q+1{f(z  ) - Zq+1)  = o{ f (l)  - 1). 

M.  T. 

q 

Since  f(z. ) = 0 and  f(l)  = IT  (l  - z, ),  the  result  follows.  □ 

i=l  1 


(6)  Corollary.  For  0 < p < 1,  the  (q+l)  equations 

q 

Z (-l)k+1  z£  = p , j = 1,2,..., q+l 

k=l  K 

have  no  solution  in  the  interior  of  S . 

q 

Proof.  For  z in  the  interior  of  S^,  the  right-hand  side  of  (1*)  is 
< p,  and  the  result  follows  immediately.  □ 

(7)  Corollary.  Fcr  0 < p < 1,  equations  (l)  have  no  solution  on  the 
boundary  of  S^. 


Proof.  Let  z be  the  boundary  of  S 

the  form  of  such  z and  the  form  of 

1<"  I,  <•••<£  <q  such  that  (z. 
- 1 r - i 

S and  satisfies 
r 


^ and  satisfy  (l).  By  examining 

(l),  it  is  clear  that  there  are 

,...,z  ) is  in  the  interior  of 

r 
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**  i+1  i 

Z (-1)  z,  = P'  > j = 1,2,..., r+1 

1=1  1 i 

where  either  p'  = p or  p'  = 1-p.  By  Corollary  (6)  this  is  a 
contradiction.  □ 

(8)  Lemma.  Det  V(z)  = IT  IT  (z.  - z.)  . 

l<k<q  1 < i < j < q 3 1 

Proof.  See  [17,  p.  36],  □ 

(9)  Corollary.  Define  f:Rq  ->  Rq  by 

f (z)  = Z (-l)k+1  z j , j = l,...,q 

0 k=l  K 

Then 

det  Df (z)  = qj  (-l)q+1  IT  (z.  - z.) 

where  Df(z)  is  the  Jacobian  matrix  of  f(z). 

Proof.  This  follows  directly  from  the  Lemma  and  elementary  properties 
of  determinants.  □ 


13  ** 


i*  #«f 


ns* 


(10)  Lemma.  Let  £ R be  given.  If  the  equation 


yi  + y2  +•••+  yq  = rx 

2 2 2 
yl  + y2  +’”+  yq  = r2 


q a q 

yl  y2  yq 


yq=rq 


have  a solution,  then  this  solution  is  unique  up  to  a permutation 
of  the  variables. 


Proof.  This  follows  from  [4,  pp.  242-245].  □ 


Proof  of  Theorem  (2).  Existence:  The  proof  will  be  by  induction  on  q. 
For  the  case  q = 1 the  result  holds  trivally. 

Suppose  that  the  result  is  true  for  q,  and  that  z,  in  the 
interior  of  S^,  is  a solution.  Define  the  maps  H:S^  X [0,1]  ->  Rq 
and  h.*Sq  x [0,1]  -*  R by 


H (z,t)  = Z (-l)k+1  z^  + (-l)q  tJ  , j = 1, . . . , q 
J k=l  K 


h(z,t)  - Z (-l)k+1  zf1  + (-1)V+1 
k=l  K 


We  need  to  find  1 > z^  > • • • > z^  > t > 0 such  that  H(z,t)  = pe  and 
h(z,t)  = p. 


... 
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Now  H(z,0)  = pe,  and  by  Lemma  (3),  h(z,0)  < p since  z is  in 


the  interior  of  S . Also  by  Corollary  (9)  the  Jacobian,  D H(z,t)  is 
q z 

nonsingular  for  all  z in  the  interior  of  and  all  t.  By  the 

Implicit  Function  Theorem  [20,  p.  128]  there  is  a 6 > 0 such  that 
for  all  t € [0,5),  the  equation  H(z,t)  = pe  has  a solution  z(t) 
in  the  interior  of  S . Moreover  z(t)  varies  continuously  with  t. 

q 

We  now  show  that  we  can  increase  t from  0 to  some  t*  while 


maintaining  H(z(t),t)  = pe,  z(t)  in  the  interior  of  S^,  0 < t < z^(t), 

and  such  that  h(z(t*),t*)  = p.  If  this  is  possible  then  (z(t*),t*) 

will  be  in  the  interior  of  S and  will  satisfy  (l)  in  the  case  of 

q+1 

q replaced  by  q+1. 

Since  DzH(z,t)  is  nonsingular  for  all  z in  the  interior  of 
S , it  is  clear  that  the  only  time  we  can  no  longer  increase  t is 
when  either  t = z^(t)  or  z(t)  hits  the  boundary  of  S^.  In  the 
event  that  t = z^(t)  or  any  two  components  of  z(t)  are  equal. 


we  can  use  the  technique  of  the  proof  of  Corollary  (7)  to  obtain  a 
reduced  overdetermined  system  having  an  interior  point  solution.  By 
Corollary  (6)  this  leads  to  a contradiction.  The  case  remaining  is  when 
for  some  t',  z^t')  = 1 and  1 > z2(t')  > •••  > zq(t')  > t'  >0.  In 
this  case  (z2(f),  ...  , zq(t'),t’)  satisfies  (l)  with  the  right-hand 
side  replaced  by  1 - p.  By  Lemma  (3)  we  get 


Hence 


Z (-l)k  zf1  + (-l)q+1  t’q+1 

k=2 

q 

= (i-p)  [i  - (l-f)  n (1  - z. ))  . 

k=2 

q 

h(z(t'),t')  = 1 - (1-p)  [1  - (l-t*)  n (i-zj)  . 

k=2  K 
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Since  the  term  { } lies  in  (0, l)  we  have 


h(z(t'),t')  > p . 

However,  h(z(0),0)  < p,  and  we  would  have  reached  an  earlier  t*  < t' 
at  which 

h(z(t*), t*)  = p. 

This  completes  the  proof  that  the  result  holds  for  q+1.  By  induction 
the  existence  is  established. 


Uniqueness:  (This  proof  is  due  to  T.  J.  Rivlin.1)  Suppose  there 

are  two  solutions  z and  z.  Upon  rearrangement,  it  is  clear  that 
z and  z satisfy 


Z1  + Z2  + Z3  + 

2 -2  , 2 , 
Z1  + Z2  + Z3  + 


= \ + z2  + Zj  + ••• 

-2  2 -2 
= Z1  + z2  + z2  + 


By  Lemma  (10)  the  vector  (z^, z0, z^, . . . ) is  a permutation  of 
(z^z^z  ,...).  Since  both  z and  z lie  in  the  interior  of 
Sq,  it  follows  that  z ^ =z^,  i = 1, ...,q. 

This  completes  the  proof.  □ 

^Private  communication. 


APPENDIX  E 


EXAMPLES  OF  SOME  STEPS  IN  THE  CONTINUOUS  TIME  SIMPLEX  ALGORITHM 
(l).  Making  a change  of  basis  over  [0,e)  with  q = 2. 


t 

x (t)  + / x (s)ds  = 2 + 2t 
1 0 


t 

x (t)  + / x (s)ds  = 2 + t 
^ 0 ^ 


*5  + *5  ■ 2 

x-l  + x^  =2 

x±(t)  >0,  i = 1, . . . , 5,  t € [0,1]  • 


Since  we  are  using  this  example  only  to  illustrate  the  pro- 
cedure for  making  a change  of  basis  over  [0,e),  we  have  omitted  the 
objective  function. 

The  detached  coefficients  are: 


‘ 1 

0 

0 

0 

0 

■ 0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

B = 

0 

0 

1 

0 

1 

, K = 

0 

0 

0 

0 

0 

_ 1 

0 

0 

1 

0 _ 

_ 0 

0 

0 

0 

0 

b(t) 


2 + 2t 
2 + t 
2 
2 
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We  begin  with  the  initial  basic  feasible  solution  (3^  = (1,2,3,  5) 
over  [0,1],  This  yields 

Xl(t)  = x2(t)  = 2,  x3(t)  = x5(t)  = 1,  xu(t)  = 0 . 

Next  we  introduce  x^  = 8 into  the  basis  over  [0,e).  Follow- 
ing Section  4.2.1  the  first  step  is  to  find  the  partial  fraction 
expansion  of 

Ms)  - i K.^)-1  + i fcu)  . 

Substituting  yields 

'I 


whose  expansion  is 


Following  (21),  (29)  and  (33)  of  Section  4.2.1  we  find 


V Pl’  V P2 


by  taking  minimum  and  maximum  ratios  on  the  vectors 


x(0)  = (2,  2,  1,  0,  1)T 
y = (0,  0,  1,  0,  -1)T 

We  obtain 

r^  = 3,  P^  = 1,  r2  = 5,  Pg  = 4. 


Since  q = 2,  we  partition  the  interval  [0,s)  into  three  sub- 
intervals [0, tuc),  [t  e,  T^e ) , and  [f^e)  on  which  we  shall  have 
the  basic  sets  c^,  <j  , ct1  respectively,  where 

= (1, 2,4,5) 
a2  = (1,2, 3,4)  . 

To  obtain  and  we  begin  by  solving  the  equations  4.2.1 

(36)  for  z^  and  z^,  and  then  set  = 1 - z^,  = 1 - z2  as  our 

initial  guess.  Should  we  obtain 

e(e-|T°,T°)  = 0 

6(1)(€1t°,t°)  = 0 

then  these  will  be  the  desired  values  of  and  Now  z^  and 
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z?  satisfy 


(2) . Using  ambiguity  in  the  dual  variables  to  obtain  a proof  of  optimality 


1 

minimize  r x (t)dt 

t 

subject  to  >•  ( t ) + / x (s)ds  = 1 - t 

° d 

X2(t)  + Xj(t)  = 1 

xt(t)  > 0 , t c [0,1] 


The  detached  coefficients  are 


1 

H 

O 

O 

i 

1 

o 

H 

O 

B = 

, K = 

. 0 1 1 

0 0 0 

1 - t 


b(t)  = 


c(t)  =[100]. 


It  can  be  easily  seen  that  there  is  only  one  feasible  solution, 

given  by 

Xl(t)  = 1-t,  x2(t)  = 0,  x (t)  = 1 , t £ [0,1]  . 

Hencp  this  must  be  the  optimal  solution. 


lbj 


FIGURE  11 


Determining  the  dual  variables  by  solving  4.1(9)  in  the  usual 
way  yields 

A*(t)  = 1,  A*(t)  = 0 

V^.  = 0 , r = 1,2,  3 = 0,1  . 

The  reduced  costs  of  x0  are 

c2(t)  = t - 1 

3^  = o , J * o,i 

and  we  observe  that  c?(t)  < 0 for  0 < t < 1. 

Noting  that  x^(l”)=  0,  we  can  relax  the  restriction  d^  = 0, 
and  parametrize  the  dual  variables  in  terms  of  y*  = d^.  Solving  the 
equations  given  by  Lemma  4.5(85): 

144 


i 


■ 


ir*)  (J  [)  - / (<,  ir*)ds  (°  £)  + <£  uX  o)  = (0*0) 


yields 


Thus 


( ^ 0W1  0.  ,1  1\  /O  Ov  / i 

u2  0 1 (*■*]_>  ^2  0 0 ~ (^->0) 

(ur  4)(o  i}  = (0’0)' 


7T*(t)  = 7T2(t)  = 0 

u?  = 1,  all  other  = 0 . 

1 r 

xx(t|r)  = i,  ( 1 1 r ) = 0 

v°(r)  = r,  all  other  v'^(y)  = 0. 
i.  r 


The  reduced  costs  of 


now  become 


c2(t|r)  t - 1 - y 
d3r  = 0,  all  r,  j 


We  see  that  c2(t|y)  >0  for  y < -1.  Since  this  also  yields 

"0  # • 
d^  = y < 0 for  such  y,  our  dual  variables  X ('If)  and  {vJ(y)} 

are  dual  feasible  in  this  range,  as  required. 
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